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I ♦ 

BORRELID IN- PRODUCING POIiYKETIDE SYNTHASE AND ITS USES 



FIELD OF THE INVENTION 

The present invention relates to materials and methods for the 
5 preparation of polyketides, i.e. structures notionally derived 
by reduction and/or other processing or modification of one or 
more ketide units. Enzyme systems , nucleic acids , vectors and 
cells are provided for the preparation of polyketides, and in 
particular the polyketide macrolide borrelidin. 

10 

BACKGROUND TO THE INVENTION 

Polyketides are natural products produced by a wide range of 
organisms, and particularly by microorganisms. Polyketides have 
many important pharmaceutical, veterinary and agricultural uses. 
15 Polyketides encompass a huge range of chemical structural space, 
and have a wide range of associated biological activities. 
Polyketides with use in medical treatments include antibiotics, 
immunosuppressants, antitumor agents, other chemotherapeutic 

» 

agents, and other compounds possessing a broad range of 
20 therapeutic and biological properties. The Gram-positive 

bacteria Streptomyces and their allied genera are prodigious 
producers of polyketides, and the genetics and biochemistry of 
polyketide biosynthesis in these organisms are relatively well 
characterised (Hopwood, 1997) . The genes for polyketide 
25 biosynthesis in Streptomyces are clustered and the exploitation 
of DNA technology has made it possible to isolate complete 
biosynthetic gene clusters by screening gene libraries with DNA 

« 

probes encoding the genes responsible for their biosynthesis. 
Thus, increasing numbers of gene clusters for polyketide 

30 biosynthesis in Streptomyces and other microorganisms have been 
isolated and sequenced, including, for example, those for the 
polyether monensin (WO 01/68867), the polyene nystatin (WO 
01/59126) and for rapamycin (Schwecke et al., 1995). 

Polyketides are synthesised through the repeated 

35 condensation of building blocks that contain a carboxylic acid 
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function. At each stage of the process this results in the 
formation of a new p-keto function and an ar-side chain branch 
into the growing chain* The structural diversity of polyketides 
derives from a number of aspects of their biosynthetic pathway 
including: the wide variety of starter units that may be 
utilised in their biosynthesis; the different lengths of 
polyketide chains that are possible; the various a-side chains 
that are introduced either during or after assembly of the 
polyketide chain; the various (3 -substitutions that may be 
introduced during or after assembly of the polyketide chain; the 
various degrees of processing that the p -keto groups can 
undergo (keto., hydroxyl, enoyl, and methylene); and the various 
stereochemistries that are possible at the a- and p -centres. 

The synthesis of polyketides is catalysed by an enzyme, or 
by a complex of enzymes, called the polyketide synthase (PKS) in 
a manner similar to that of fatty acid biosynthesis. - 
Streptomyces and related genera PKSs fall into three main 
categories: type-I, type-Hand type-Ill. The type-III PKSs are 
small proteins related to plant chalcone synthases that have 
been discovered only recently (Moore & Hopke, 2000) . Type-III 
systems have been implicated in the biosynthesis of a small 
number of secondary metabolites but may be more generally 
involved in the biosynthesis of soluble pigments. The type-II 
PKSs consist of several mono functional proteins that act as a 
multi-polypeptide complex. Simple aromatic polyketides such as 
actinorhodin are formed by several rounds of chain assembly, 
which are performed iteratively on one set of type-II PKS 

* 

enzymes that are encoded for by one set of PKS genes (Hopwood, 
1997) . Type-I PKSs are multifunctional proteins and are required 
for the synthesis of more complex polyketides such as 
erythromycin and rapamycin. As the focus of this patent, type-I 
PKS organisation and function are described in detail below: 

Type-I PKSs are organised into modules, whereby each 
module consists of several catalytic * domains' that are required 
to carry out one round of chain assembly (Staunton & Wilkinson, 



1997). In general a modular PKS contains . the correct number of 
modules (loading plus extension modules) to select and condense 
the correct number of loading and extension units. For example 
the erythromycin PKS consists of 7 modules (one. loading and six 
extension modules) to select and condense the one starter and 
six extension units required for the biosynthesis of the 
erythromycin precursor 6-deoxyerythronolide B. Thus, there 
exists a one to one relationship between the number of modules 
present in the PKS and the number of units incorporated. This 
one to one relationship is described as * co-linearity ' . 

The term ^extension module' as used herein refers to the 
set of contiguous domains, from the p-ketoacyl-acyl carrier 
protein synthase (KS) domain to the next acyl carrier protein 
• (ACP) domain, which accomplishes one cycle of polyketide chain 
extension. The term ^loading module' as used herein refers to 
any group of contiguous domains that accomplishes the loading of 
the starter unit onto the PKS and thus renders it available to 
the KS domain of the first extension module. Besides 
condensation of the next extender carboxylic acid (or ketide) 
unit onto the growing polyketide chain, which is performed by 
the catalytic activity of the essential KS domain, modules of 
type-I PKSs may contain domains with p-ketoreductase (KR) , 
dehydratase (DH) , and enoyl reductase (ER) activities which are 
responsible for the further processing of the newly formed p- 
keto groups during chain extension. The acyl transferase (AT) 
and the ACP domains present in each module are responsible for 
the choice of extender unit, and the tethering of the growing 
chain during its passage on the PKS respectively. The completed 
polyketide chain is generally released from PKSs by the action 
of a terminal thioesterase (TE) domain that is also generally 
involved in the cyclisation (lactonisation) of the final 
product. Other chain terminating/cyclising strategies are also 
employed such as that for the addition of an amino acid residue 
and macrolactam formation as observed for rapamycin (Schwecke et 
al., 1995), for macrolactam formation as for rifamycin (August 



et al., 1998), and for amino acid incorporation followed by 
reductive elimination as for myxalamid biosynthesis (Silakowski 
et al., 2001). In summary, there is a single enzymatic domain 
present for each successive catalytic step that occurs during 
5 biosynthesis on the PKS, and they are used in defined sequence 
that depends upon their location within the protein and the 
particular function they perform. This mechanism is termed 
*processive f . 

The modular arrangement of type-I PKSs was first confirmed 
10 by mutation of the erythromycin PKS (also known as 6- 

deoxyerythronolide B synthase, DEBS) through an in-frame 
deletion of a region of the KR domain of module 5 (Donadio et 
al., 1991). This led to the production of the erythromycin 
analogues, 5 f 6-dideoxy-3-a-mycarosyl-5-oxoerythronolide B and 
15 5, 6-dideoxy-5-oxoerythronolide B, due to the inability of the 

mutated KR domain to reduce the p-keto group 5 at this stage of 
processive biosynthesis. Likewise, alteration of the active site 
residues in the ER domain of module 4 of DEBS 2, by genetic 
engineering of the corresponding PKS-encoding DNA and its 
20 introduction into Saccharopolyspora erythraea, led to the 

production of 6, 7-anhydroerythromycin C (Donadio et al., 1993). 
In addition, the length of the polyketide chain formed by DEBS 
has been altered through the specific relocation of the TE 
domain of DEBS3 to the end of DEBS1; the expected triketide 
25 lactone product was produced in good yield (Cortes et al., 

1995) . It should be noted that the changes described involved 
modification by deletion of sequence, or. by sequence specific 
inactivation, or by the alternative juxtaposition of DNA 
sequence from within the same PKS cluster (ie. they are 
30 considered y homologous changes') . Other such homologous' 

changes to the erythromycin PKS are described in WO 93/13663. 

The modular organisation of type-I PKS genes lends itself 
to their manipulation to produce altered polyketide structures. 
Type I PKSs represent an assembly line for polyketide 
35 biosynthesis that can be manipulated by changing the number of 
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modules; by changing their specificities towards different 
carboxylic acid starter units and extender units; by 
inactivating, mutating, removing, swapping or inserting domains 
with different activities and specificities; and by altering the 
5 chain or ring size through the repositioning of termination or 
cyclisation domains (Staunton & Wilkinson, 1997) . 

WO 98/01546 describes the production of hybrid PKS gene 
assemblies comprising the incorporation of heterologous DNA. WO 
98/01546 describes methods for generating hybrid PKSs in which 

10 the substitution of genes encoding heterologous modules, sub- 
modules or domains for the native genes generates novel 
polyketides with altered structures. Specifically, for example 
the AT domains of heterologous DNA from the rapamycin or 
monensin PKSs can be exchanged for that native to the 

15 erythromycin PKS in order to generate novel polyketides with 

altered alkyl branching- Such an AT domain swap represents the 
first example of the production of a truly hybrid PKS (Oliynyk 
et aim, 1996). WO 98/01546 also describes in general terms the 
production of hybrid PKS assemblies comprising a loading module 

20 and at least one extension module. It specifically describes the 
construction of a hybrid PKS gene assembly by grafting the 
broad-specificity loading module for the avermectin-producing 
PKS onto the first protein of the erythromycin PKS (DEBS1) in 
place of the normal loading module (see also Marsden et al., 

25 1998) . Additional examples comprising loading module swaps that 
are substrate specific have also been described (WO 00/00618; US 
5876991; Kuhstoss et al., 1996). WO 00/01827 describes methods 
for varying the p-keto processing capability of a PKS module 
through the ability to swap ^reductive loops', ie. the ability 

30 to rapidly and in a combinatorial manner, alter the number and 
type of ketoreductase, dehydratase and enoyl reductase domains 
within a module. In addition to changing the level of p-keto 
group processing, such changes may also lead to changes in 
stereochemistry of the oc-alkyl and p-hydroxyl groups thus formed 

35 by the altered modules. 
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Although modular PKSs operate 'normally' in a co-linear 
and processive manner as described above, examples of a 
deviation from this mode of operation have been described and 
are discussed below: 
5 The picromycin PKS gene cluster in Streptomyces Venezuela e 

is responsible for the biosynthesis of both picromycin (a 14- 
membered, heptaketide macrolide) and methymycin (a 12-membered, 
hexaketide macrolide) (Xueet al., 1998). The ability of a 
single PKS to produce two related macrolides, of different ring 
10 sizes, derives from the alternative expression of the final PKS 
gene p±kA4 (Xue & Sherman, 2000) . When 'normal' expression 
occurs and full-length PikA4 is formed, a sixth extension unit 
is incorporated and the picromycin aglycone is produced; when 
alternative expression occurs and an N-terminally truncated form 
15 of PikA4 is produced, no sixth extension unit is incorporated 
and the growing polyketide chain is passed directly to the TE 
domain which leads to formation of the methymycin aglycone. 
Thus, a breakdown of co-linearity occurs and a 'ring contracted' 
product is formed. The biochemical basis for this phenomenon has 
20 been investigated and shown to be an ACP5 to ACP6 transfer, 

missing out covalent attachment to the intervening KS6 domain; 
such a breakdown of co-linearity has been called 'skipping' 
(Beck et al. , 2002) . 

Skipping has also been observed to occur when an extra 
25 extension module from the rapamycin PKS was interpolated into 

the erythromycin PKS in order to convert the natural heptketide- 
producing PKS into an octaketide-producing one (Rowe et al., 
2001) . The expected octaketide, 16-membered macrolide was 
produced, but the major product was the normal heptaketide 
30 product 6-deoxyerythronolide. This 'skipping' of the 

interpolated module is believed to occur due to the interpolated 
module acting on some occasions as a 'shuttle' , passing the 
growing chain from the preceding module to the following 
downstream module without performing a round of chain extension. 
35 It was subsequently shown that the ACP domain of the 
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interpolated module is essential in passing the; growing 
polyketide chain from the preceding ACE domain and passing it to 
the KS domain of the following module during skipping (Thomas et 
al., 2002), a mechanism similar to that described for methymycin 
5 biosynthesis above. It is shown that skipping can occur without 
the active site nucleophile of the KS domain. A ring-contracted 
(skipped) nemadectin (an antiparasitic macrolide) has been 
reported from a mutant of a Stxeptomyces soil isolate that was 
modified by chemical mutation (Rudd et al., 1990); the 
10 biosynthesis of the natural PKS product was abolished. 

An alternative manner in which modular PKSs deviate from 

■ 

co-linear operation involves the iterative operation of modules. 
For example, module 4 of the erythromycin PKS appears to operate 
iteratively, at a low level, to produce a ring expanded 16- 

15 membered, octaketide macrolide related to 6-deoxyerythronolide B 
(Wilkinson et al., 2000). The ability of the erythromycin PKS to 
perform this operation has been termed ^stuttering' . The 
'stuttering' of the erythromycin PKS is considered an aberrant 
process, as the products of this stuttering are formed in low 

20 yield and the major product of the erythromycin PKS is the 

normal heptaketide 6-deoxyerythonolide B formed by co-linear 
operation. Products that appear to be formed by both stuttering 
and skipping have also been reported as minor components from 
the epothilone producer Sorangium cellulosum (Hardt et al., 

25 2001) . The stigmatellin biosynthetic cluster of Stigmatella 

aurantlaca encodes for a PKS that comprises ten (one loading and 
nine extension) modules (Gaitatzis et al., 2002); however, based 
on results . from structural elucidation and the feeding of stable 
isotope labelled substrates, stigmatellin is formed from eleven 

30 modular derived units. Thus, it would appear that one of the 
stigmatellin PKS modules operates (twice) iteratively. 

Additional structural diversity can be generated through 
the modification of polyketides by enzymes other that the PKS, 
either during the process of chain assembly as seen during the 

35 biosynthesis of some ansamycins (Floss, 2001) , or after the 
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process of chain assembly following release from the PKS. Such 
non-PKS mediated reactions may include, but are not limited to 
the following: reduction, oxidation, hydroxylation, acylation, 
alkylation, amination, decarboxylation, dehydration, double bond 
isomerisation/migration, cyclisation, ring cleavage, 
conjugation, glycosylation, reductive elimination and any 
combination of these. When these reactions occur after chain 
assembly they are termed the post-PKS or tailoring steps. Such 
tailoring steps are generally, but not always, essential for 
endowing the polyketide natural product with biological 
activity. 

In addition, the structural diversity of polyketides 
obtainable biosynthetically can be further enhanced through the 
use of defined -heterologous post-PKS tailoring enzymes as well 
15 as through the use of those which naturally modify the natural 
polyketide (Gaisser et al., 2000). WO 01/79520 describes the 
heterologous modification of polyketide macrolide structures 
through glycosylation, epoxidation, hydroxylation, and 
methylation. The ability to generate analogues of the 
20 agricultural compound spinosyn through glycosylation with 

alternative deoxyhexose substituents has been reported (Gaisser 

et al., 2002) . 

Borrelidin 1 (figure 1) is an 18-membered macrolide 
produced by several bacterial strains including, but not limited 

25 to, Streptomyces rochei ATCC23956, Streptomyces parvulus TU113 
and Streptomyces parvulus TU4055. The gross structure of 
borrelidin was first elucidated in 1967 (Keller-Scheirlein, 
1967) , and was subsequently refined by detailed NMR analysis 
(Kuo et al., 1989). The absolute configuration of borrelidin was 

30 confirmed by X-ray crystallography (Anderson et al., 1989). Its 
co-identity as the antibiotic treponemycin has been verified 

(Maehr & Evans, 1987) . 

Borrelidin was first discovered due to its antibacterial 
activity (Berger et al., 1949), although this antibacterial 
35 activity extends only to a limited number of micrococci, and is 
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not found against all common test bacteria. The mode of action . 
in sensitive microorganisms involves selective inhibition of 
threonyl tRNA synthetase (Paetz & Nass, 1973) . Other activities 
against spirochetes of the genus Treponema (Singh et al., 1985; 
5 US 4,759,928), against viruses (Dickinson et al., 1965), uses 
for the control of animal pests and weeds (DE 3607287) and use 
as an agricultural fungicide (DE 19835669; US 6,193,964) have 
been reported. Between all of these reports only two reported 
any synthetically modified derivatives. The first of these 

10 describes the benzyl ester and its bis-O- (4-nitrobenzoyl) 

derivative (Berger et al., 1949). ; The second of these describes 
the borrelidin methyl ester, the methyl ester bis O-acetyl 
derivative, and the methyl ester zii 4 _ 15 -dihydro-, Ai 4 _i 5 , 12-13- 
tetrahydro-, and An-is, 12-13~tetrahydro-Ci2-a1rri.no derivatives 

15 (Anderton & Rickards, 1965) . No biological activity was reported 
for any of these compounds . 

A more recent disclosure of significance is the discovery 
that borrelidin displays anti-angiogenesis activity (Wakabayashi 
et-al., 1997). Angiogenesis is the process of the formation of 

20 new blood vessels. Angiogenesis occurs only locally and 

transiently in adults, being involved in, for example, repair 
following local trauma and the female reproductive cycle. It has 
been established as a key component in several pathogenic 
processes including cancer, rheumatoid arthritis and diabetic 

25 retinopathy. Its importance in enabling tumours to grow beyond a 
diameter of 1-2 cm was established by Folkman (Folkman, 1986), 
and is provoked by the tumour responding to hypoxia. In its 
downstream consequences angiogenesis is mostly a host-derived 
process, thus inhibition of angiogenesis offers significant 

30 potential in the treatment of cancers, avoiding the hurdles of 

other anticancer therapeutic modalities such as the diversity, of 
cancer types and drug resistance (Matter, 2001) . It is of 
additional interest that recent publications have described the 
functional involvement of tyrosinyl- and tryptophanyl tRNA 
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synthetases in the regulation of angiogenesis (Wakasugi et al., 
2002; Otani et al., 2002). 

In the rat aorta matrix culture model of angiogenesis, 
borrelidin exhibits a potent angiogenesis-inhibiting effect and 
also causes disruption of formed capillary tubes in a dose 
dependent manner by inducing apoptosis of the capillary- forming 
cells (Wakabayashi et al., 1997). Borrelidin inhibited capillary 
tube formation with an IC 5 o value of 0.4 ng/ml (0.8 nM) . In the 
same study, borrelidin was shown to possess anti-prolif erative 
activity towards human umbilical vein . endothelial cells (HUVEC) 
in a cell growth assay; the IC 50 value was measured at 6 ng/ml, 
which is 15-fold weaker than the anti-angiogenesis activity 
measured in the same medium. This anti-proliferative activity of 
borrelidin was shown to be general towards various cell lines. 
In addition to these data the authors report that borrelidin 
inhibits tRNA synthetase and protein synthesis in the cultured 
rat cells; however the IC 50 value for anti-angiogenesis activity 
(0.4 ng/ml) was 50-fold lower than that reported for inhibition 
of protein synthesis (20 ng/ml) f indicating different activities 

of the compound. 

Borrelidin also displays potent inhibition of angiogenesis 
in vivo using the mouse dorsal air sac model (Funahashi et al., 
1999) , which examines VEGF-induced angiogenesis and is an 
excellent model for studying tumour-angiogenesis . Borrelidin was 
administered at a dose of 1.8 mg/kg by intraperitoneal injection 
and shown to significantly reduce the increment of vascular 
volume induced by WiDr cells, and to a higher degree than does 
TNP-470, which is a synthetic angiogenesis inhibitor in clinical 
trials. Detailed controls verified that these data are for 
angiogenesis inhibition and not inhibition of growth of the 
tumour cells. The authors also showed that borrelidin is 
effective for the inhibition of the formation of spontaneous 
lung metastases of B16-BL6 melanoma cells at the same dosage by 
inhibiting the angiogenic processes involved in their formation. 
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JP 9-227,549 and JP 8-173,167 confirm that borrelidin is 
effective against WiDr cell lines of human colon cancer, and 
also against PC-3 cell lines of human prostate cancer. JP 9- 
227,549 describes the production of borrelidin by Streptomyces 
rochei Mer-N7167 (Ferm P-14670) and its isolation from the 
resulting fermentation culture. In addition to borrelidin 1, the 
12-desnitrile-12-carboxyl borrelidin 2 (presumably a 
biosynthetic intermediate or shunt metabolite) , the C10- 
desmethyl borrelidin 3 (presumably a biosynthetic analogue 
arising from the mis-incorporation of an alternative malonyl-CoA 
extender unit in module 4 of the borrelidin PKS) , the CIO- 
epiborrelidin 4 and the C14,C15-cis borrelidin analogue 5 were 
described (see figure 1). Thus, JP 9-227,549 specifies 
borrelidin and borrelidin analogues wherein a nitrile or 
carboxyl group is attached the carbon skeleton at C12, and a 
hydrogen atom or lower alkyl group is attached to the carbon 
skeleton at CIO. 

WO 01/09113 discloses the preparation of borrelidin 
analogues that have undergone synthetic modification at the 
carboxylic acid moiety of the cyclopentane ring. The activity of 
these compounds was examined using endothelial cell 
proliferation and endothelial capillary formation assays in a 
similar manner to that described above. In general, modification 
of the carboxyl moiety improved the selectivity for inhibiting 
capillary formation: the major reason for this improvement in 
selectivity is through a decrease in the cell proliferation 
inhibition activity whereas the capillary formation inhibitory 
activity was altered to a much lower degree. Specifically, the 
borrelidin-morpholinoethyl ester showed a 60-fold selectivity 
index, the borrelidin-amide showed a 37-fold selectivity index, 
the borrelidin- (2-pyridyl) -ethyl ester showed a 7.5-fold 
selectivity index and the borrelidin-morpholinoethyl amide 
showed a 6-fold selectivity index, for the capillary formation 
inhibitory activity versus cell proliferation with respect to 
borrelidin. The capillary formation inhibitory activity of these 
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and other borrelidin derivatives was verified using a micro- 
vessel formation assay. In addition, the authors showed that 
borrelidin weakly inhibited the propagation of metastatic 
nodules, after removal of the primary tumour, when using a Lewis 
5 lung adenocarcinoma model. However, the borrelidin- (3- 
picolylamide) derivative was reported to inhibit very 
considerably the increase of micrometastases in rats after 
intraperitoneal and also with per os administration at subtoxic 

4 

doses. Similarly, using the colon 38 spleen liver model, the 
10 metastasis-forming ability of mouse colon adenocarcinoma cells 
transplanted into mouse spleen was considerably decreased after 
treatment with a subtoxic dose of this borrelidin derivative. 
These data confirm the earlier reported ability of borrelidin 
and its derivatives to inhibit the formation of metastases . 
15 Borrelidin has also been identified as an inhibitor of 

cyclin-dependant kinase Cdc28/Cln2 of Saccharomyces cerlvisiae 
with an IC 50 value of 12 pg/ml (24 pM) (Tsuchiya et al., 2001). 
It was shown that borrelidin arrests both haploid and diploid 
cells in late Gi phase (at a time point indistinguishable from 
20 a-mating pheromone) , and at concentrations that do not affect 
gross protein biosynthesis. These data were taken to indicate 
that borrelidin has potential as a lead compound to develop 

anti-tumour agents . 

In summary, the angiogenesis-inhibitory effect of 
25 borrelidin is directed towards the twin tumour-biological 

effects of proliferation and capillary formation. In addition, 
borrelidin, and derivatives thereof, have been shown to inhibit 
the propagation of metastases. Borrelidin also has indications 
for use in cell cycle modulation. Thus, borrelidin and related 
30 compounds are particularly attractive targets for investigation 
' as therapeutic agents for the treatment of tumourous tissues, 
either as single agents or for use as an adjunct to other 
therapies. In addition, they may be used for treating other 
diseases in which angiogenesis is implicated in the pathogenic 
35 process, including, but not restricted to, the following list: 
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rheumatoid arthritis, psoriasis, atherosclerosis, diabetic 
retinopathy and various ophthalmic disorders . 

SUMMARY OF THE INVENTION 

The present invention provides the entire nucleic acid 
sequence of the biosynthetic gene cluster responsible for 
governing the synthesis of the polyketide macrolide borrelidin 
in Streptomyces parvulus TU4055. Also provided is the use of all 
or part of the cloned DNA in the specific detection of other 
polyketide biosynthetic gene clusters, in the engineering of 
mutant strains of Streptomyces parvulus and other suitable host 
strains for the production of enhanced levels of borrelidin, or 
for the production of novel polyketides, and of recombinant 
genes for the biosynthesis of novel polyketide products . 

The present invention provides an isolated nucleic acid 
molecule comprising all or part of a borrelidin biosynthetic 
gene cluster, or a portion thereof. 

The complete nucleotide sequence of the borrelidin 
biosynthetic gene cluster from Streptomyces parvulus Tu4055 is 
shown in SEQ ID No . 1 . Its organisation is presented in figure 3 
and comprises genes and open reading frames designated 
hereinafter as: borAl, borA2, borA3 f borA4 f borA5, borA6, borB, 
borC, borD, borE, borF, borG, borH, borl, borJ, borK, borL, 
borM, borN, borO, orfBl, orfB2 r orfB3 r orfB4, orfB5, orfB6, 
orfBl, orfBB, orfB9, or f BIO, orfBl 1, orfB12, orfBl 3, orfBl 4, 
orfBIS, orfB16, orfB17, orfB18, orfB19, orfB20, orfB21 and 
orfB22 . 

The proposed functions of the cloned genes are described 
in Figures 4 (proposed biosynthesis of the starter unit) , 5 
(organisation of the borrelidin PKS and biosynthesis of pre- 
borrelidin) and 6 (introduction of the C12~nitrile moiety) and 
are described below. 

The present invention thus provides an isolated nucleic 
acid molecule comprising: 
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(a) a nucleotide sequence as shown in SEQ ID No.l, or a portion 
or fragment thereof; or 

(b) a nucleotide sequence which is the complement of SEQ ID 
No.l, or a portion or fragment thereof; or 

(c) a nucleotide sequence which is degenerate with a coding 
sequence of SEQ ID No.l, or a portion or fragment thereof. 

A preferred portion or fragment of SEQ ID NO:l is the 
sequence extending between nucleotide positions 7603 and 59966 

of SEQ ID No.l. 

The sequence may encode or be complementary to a sequence 
encoding a polypeptide of a polyketide biosynthetic gene 
cluster, or a portion thereof. A polyketide biosynthetic gene 
cluster is a segment of DNA comprising a plurality of genes 
encoding polypeptides having activity in the biosynthesis of a 
polyketide or macrolide moiety. This is not restricted to 
components of the polyketide synthase (PKS) which function Inter 
alia in the synthesis of the polyketide backbone and reductive 
processing of side groups, but also encompasses polypeptides 
having ancillary functions in the synthesis of the polyketide. 
Thus polypeptides of the biosynthetic gene cluster may also act 
in macrolide ring or polyketide chain modification (e.g. 
catalysing a reaction in the formation of the C12 nitrile moiety 
of borrelidin) , in the synthesis of a precursor or starter unit 
for a polyketide or macrolide moiety (e.g. catalysing a reaction 
in the synthesis of the trans-cyclopentane-1, 2-dicarboxylic acid 
starter unit for the borrelidin PKS, or responsible for the 
activation of such molecules* as the coenzyme-A thioesters of the 
starter and extender units of the chain) , regulatory activity 
(e.g. regulation of the expression of the genes or proteins 
involved in polyketide or macrolide synthesis) , transporter 
activity (e.g. in transport of substrates for the polyketide or 
macrolide moiety into the cell, or of synthesis products such as 
the polyketide or macrolide molecule out of the cell) , and in 
conferring resistance of the producing cell to the synthesised 
products (e.g. through specific binding to the synthesised 
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molecule, or as a replacement for other endogenous proteins to 
which the synthesised molecule may bind within or outside of the 
cell) • 

The gene cluster also includes non-coding regions, such as 
5 promoters and other transcriptional regulatory sequences which 
are operably linked to the coding regions of the gene cluster. 
The skilled person is well able to identify such elements, based 
upon the inf ormation provided herein, and these are within the 
scope of the present invention. 

10 Genes and open reading frames encoded within SEQ ID No.l 

represent preferred parts or fragments of SEQ ID No.l. Thus an 
isolated nucleic acid molecule may comprise a sequence that 
encodes a polypeptide from a borrelidin biosynthetic gene 
cluster, wherein said polypeptide has an amino acid sequence 

15 selected from the group consisting of SEQ ID Nos.2 to 44. 

In preferred embodiments, the nucleic acid sequence 
comprises an open reading frame selected from the group of open 
reading frames of SEQ ID NO: 1 consisting of borAl, borA2 r 
borA3, borA4, borAS, borA6, borB, borC, borD, borE, borF, borG, 

20 borti, borl, borJ, borK, borL, borM, borN, borO, orfBl, orfB2 r 

orfB3 r orfB4 f orfB5, orfB6 r orfBl, orfBS, orfB9 f or f BIO, orfBll, 
orfB12, orfB13a, orfB13b, orfBl4, orfBIS, orfBl6, orfBll, 
orfBl 8, orfBl 9, orfB20, orfB21 and orfB22, said open reading 
frames being described by, respectively, bases 16184*-18814 , 

25 18875-23590, 23686-34188, 34185*-39047 , 39122*-45514, 45514- 

50742, 7603-8397c, 8397-9194c, 9244-9996c, 9993-11165c, 11162- 
11980c, 11992-13611c, 13608-15659*c, 50739*-52019, 52113-53477, 
53486-54466, 54506-56176, 56181*-57098 , 57112-57858, 57939- 
59966, 2-313 (incomplete), 501*-3107, 3172-3810c, 3935-4924c, 

30 5123-5953, 5961-6518*c, 6564*-7538, 60153-r60533*c, 60620-61003, 
61188*-61436, 61526-61738, 61767-62285c, 62750-63067c, 62586- 
62858c, 63155-65071c, 65374-65871, 65942-68305*c, 68290-68910*c, 
69681-70436, 70445-71848, 71851-72957, 73037-73942 and 73995- 
74534c of SEQ ID No.l. 
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In the above list, y C indicates that the gene is encoded 
by the complementary strand to that shown in SEQ ID NO: 1. Each 
open reading frame above represents the longest probable open 
reading frame present. It is sometimes the case that more than 
5 one potential start codon can be identified. One skilled in the 
art will recognise this and be able to identify alternative 
possible start codons. Those genes which have more than one 
possible start codon are indicated with a symbol.] 

It should be noted that a number of these open reading 
10 frames begin with a codon (GTG, CTG or TTG) other than the more 
normal ATG initiation codon. It is well known that in some 
bacterial systems such codons, which normally denote valine 
(GTG) or leucine (CTG, TTG), may be read as initiation codons 
encoding methionine at the N terminus of the polypeptide chain. 
15 In the amino acid sequences (SEQ ID Nos: 2 to 44) provided 
herein, such codons are therefore translated as methionine. 

Also provided are nucleic acid molecules encoding portions 
of the open reading frames identified herein. For example, such 
a nucleic acid sequence may comprise one or more isolated 
20 domains derived from the open reading frames identified herein. 
The polypeptides encoded by these isolated portions of the open 
reading frames may have independent activity, e.g. catalytic 
activity. In particular, the polypeptides which make up the 
borrelidin PKS have modular structures in which individual 
25 domains have particular catalytic activities as set out above. 
Thus any of these domains may be expressed alone or in 
combination, with other polypeptides from the borrelidin PKS 
described herein or domains thereof, or with polypeptides from 
the PKS of other polyketides. 
30 The term 'PKS domain' as used herein refers to a 

polypeptide sequence, capable of folding independently of the 
remainder of the PKS, and having a single distinct enzymatic 
activity in polyketide or macrolide synthesis including, but not 
restricted to p-ketoacyl-acyl carrier protein synthase (KS) , 
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acyl carrier protein (ACP) , acyl transferase (AT) , 0- 
ketoreductase (KR) , dehydratase (DH) , enoyl reductase (ER) or 
terminal thioesterase (TE) . 

Accordingly, the invention further provides: 
5 (a) an isolated nucleic acid molecule comprising a sequence that 
encodes a PKS domain selected from ATO and ACPO, said domains 
being described by, respectively, amino acids 322-664 and 694- 
763 of SEQ ID No. 2. In a preferred embodiment, the PKS domain 
comprises a sequence selected from the group consisting of bases 
10 17147-18175 and 18263-18472 of SEQ ID No.l; 

(b) an isolated nucleic acid molecule comprising a sequence that 
encodes a PKS domain selected from KS1, ATI, KM and ACPI, said 
domains being described by, respectively, amino acids 34-459, 
557-885, 1136-1379 and 1419-1486 of SEQ ID No. 3. In a preferred 

15 embodiment, the PKS domain comprises a sequence selected from 
the group consisting of bases 18974-20251, 20543-21529, 22280- 
23011 and 23129-23332 of SEQ ID No.l; 

(c) an isolated nucleic acid molecule comprising a sequence that 
encodes a PKS domain selected from KS2, AT2 , DH2, KR2, ACP 2, 

20 KS3, AT3, DH3, KR3 and ACP 3, said domains being described by, 

respectively, amino acids 34-459, 559-887, 903-1050, 1354-1597, 

* 

1628-1694, 1724-2149, 2245-2576, 2593-2734, 3060-3307 and 3340- 
3406 of SEQ ID No. 4. In a preferred embodiment, the PKS domain 
comprises a sequence selected from the group consisting of bases 
25 23785-25062, 25360-26346, 26392-26835, 27745-28476, 28567-28767, 
28855-30132, 30418-31413, 31462-31887, 32863-33606 and 33703- 
33903 of SEQ ID No.l; 

(d) an isolated nucleic acid molecule comprising a sequence that 
encodes a PKS domain selected from KS4, AT4, KR4 and ACP 4 , said 

30 domains being described by, respectively, amino acids 34-459, 

555-886, 1179-1423 and 1459-1525 of SEQ ID No. 5. In a preferred 
embodiment, the PKS domain comprises a sequence selected from 
the group consisting of bases 34284-35561, 35847-36842, 37719- 
38453 and 38559-38759 of SEQ ID No.l; 
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(e) an isolated nucleic acid molecule comprising a sequence that 
encodes a PKS domain selected from KS5, ATS, DH5, ER5, KR5 and 
ACP5, said domains being described by, respectively, amino acids 
34-457, 553-888, 905-1046, 1401-1690, 1696-1942 and 1975-2041 of 
SEQ ID No, 6. In a preferred embodiment, the PKS domain comprises 
a sequence selected from the group consisting of bases 39221- 
40492, 40778-41785, 41834-42259, 43322-44191, 44207-44947 and 
45044-45244 of SEQ ID No.l; 

(f ) an isolated nucleic acid molecule comprising a sequence that 
encodes a PKS domain selected from KS6, AT6, KR6, ACP6 and TE, 
said domains being described by, respectively, amino acids 37- 
457, 555-883, 1101-1335, 1371-1437 and 1461-1708 of SEQ ID No. 7. 
In a preferred embodiment, the PKS domain comprises a sequence 
selected from the group consisting of bases 45622-46884, 47176- 
48162, 48814-49518, 49624-49824 and 49894-50637 of SEQ ID No.l. 

In another of its aspects the invention provides an 
isolated nucleic acid molecule comprising a sequence that 
encodes a PKS module, said module being selected from the group 
consisting of amino acids 322-763 of SEQ ID No. 2, 34-1486 of SEQ 
ID No. 3, 34-1694 of SEQ ID No. 4, 1724-3406 of SEQ ID No. 4, 34- 
1525 of SEQ ID No. 5, 34-2041 of SEQ ID No . 6 and 37-1437 or 1708 
of SEQ ID No. 7. In a preferred embodiment, the module comprises 
a sequence selected from the group consisting of bases 17147- 
18472, 18974-23332, 23785-28767, 28855-33903, 34284-38759, 
39221-45244, 45622-49824 or 50637 of SEQ ID No.l. 

The term 'module' as used herein refers to a single 
polypeptide comprising a plurality of PKS domains each having a 
single distinct enzymatic activity in polyketide or macrolide 
synthesis including, but not restricted to p-ketoacyl-acyl 
carrier protein synthase (KS) , acyl carrier protein (ACP) , p- 
ketoreductase (KR) , dehydratase (DH) , enoyl reductase (ER) or 
terminal thioesterase (TE) . 
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Also provided is a nucleic acid molecule encoding a 
polyketide synthase comprising a sequence encoding one or more 
of the domains or modules described above. 

The sequences provided herein provide means with which to 
manipulate and/or to enhance polyketide synthesis. Thus there 
is provided a method of modifying a parent polyketide synthase, 
comprising expressing a domain from a borrelidin polyketide 
synthase or a derivative thereof as described herein in a host 
cell expressing said parent polyketide synthase, such that the 
domain is incorporated into said parent polyketide synthase. 
The borrelidin PKS domain may be inserted in addition to the 
native domains of the. parent PKS f or may replace a native parent 
domain . 

The present invention further provides methods of 
modifying a parent borrelidin PKS. A donor domain may be 
expressed in a host cell expressing said parent borrelidin PKS. 
Additionally or alternatively, a domain of the parent PKS may be 
deleted or otherwise inactivated; e.g. a parent domain may 
simply be deleted, or be replaced by a domain from a donor PKS, 
or a domain from a donor PKS may be added to the parent. Where 
a domain is added or replaced, the donor domain may be derived 
from the parent synthase, or from a different synthase. 

These methods may be used to enhance the biosynthesis of 
borrelidin, to produce new borrelidin derivatives or analogues, 
or other novel polyketide or macrolide structures . The number 
and nature of modules in the system may be altered to change the 
number and type of extender units recruited, and to change the 
various synthase, reductase and dehydratase activities that 
determine the structure of the polyketide chain. Such changes 
can be made by altering the order of the modules that comprise 
the PKS, by the duplication or removal of modules that comprise 
the PKS, by the introduction of modules from heterologous 
sources, or by some combination of these various approaches. 

Thus domains or modules of the borrelidin PKS may be 
deleted, duplicated, or swapped with other domains or modules 
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from the borrelidin PKS, or from PKS systems responsible for 
synthesis of other polyketides (heterologous PKS systems), which 
may be from different bacterial strains or species. 
Alternatively domains or modules from the borrelidin PKS may be 
introduced into heterologous PKS systems in order to produce 
novel polyketide or macrolides. 

For example, a particular extender module may be swapped 
for one having specificity for a different extender unit (as 
described e.g. in WO98/01571 and WO98/01546), or mutated to 
display specificity or selectivity for a different extender unit 
e.g. as described below. Additionally or alternatively, 
swapping or mutation of domains or modules, such as the KR, DH 
and ER domains responsible for the processing of a given p-keto 
moiety, may be used to alter the level of reductive processing 
of an extender unit during polyketide synthesis. In a preferred 
embodiment the BorAS module may be introduced into a parent PKS 
to provide iterative addition of extender units to a polyketide 
backbone, e.g. expanding the ring size of a macrolide polyketide 
relative to that naturally produced by the parent PKS. 

The borrelidin loading module is the first PKS loading 
module to be identified having specificity for an alicyclic di- 
carboxylic acid starter unit. Thus this module or a derivative 
thereof may be used to introduce alicyclic starter units into 
heterologous polyketide synthases. This need not be restricted 
to use of trans-cyclopentane-1, 2, -dicarboxylic acid normally 
used as the borrelidin starter unit. The borrelidin loading 
module is herein shown also to be capable of directing 
incorporation of other starter units including trans- 
cyclobutane-l,2-dicarboxylic acid and DL-2-methylsuccinic acid. 
The borrelidin starter unit may also be modified in a borrelidin 
producing cell, or replaced by a heterologous loading module, to 
introduce alternative starter units into the borrelidin 

synthetic pathway. 

The position of the loading module of the PKS may be 
chosen (e.g. by fusing it to a particular location within the 
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PKS) in order to control the ring size of the resultant 
polyketide/macrolide molecules. 

The AT domains that determine the carboxylic acid-CoA 
thioester extender units may be deleted, modified or replaced. 
5 The ACP domains may also be deleted, modified or replaced. In 

addition domains that are not normally present in the borrelidin 
PKS but which are found in other modular PKS and/or mixed- 
PKS/NRPS systems may be inserted. Examples include, but are not 
limited to: O-methyl transferase domains, C-methyl transferase 
10 domains, epimerisation domains, monooxygenase domains and 
dehydrogenase domains . 

Further, the thioesterase domain of the borrelidin PKS may 
be altered or repositioned (e.g. fused to a chosen location 
within the PKS) in order to change its specificity and/or in 
15 order to release polyketide/macrolide molecules with a chosen 

ring size. Alternatively, heterologous thioesterase domains may 
be inserted into the borrelidin PKS to produce molecules with 
altered ring size relative to the molecule normally produced by 
the parent PKS, or to produce a free acid. 
20 In yet another alternative, the amino acid incorporating 

and macrolactam forming domains from mixed NRPS/PKS systems such 
as that for rapamycin, or for related systems such as for 
rifamycin biosynthesis and myxalamid biosynthesis, or modules 
from NRPS systems (such as those for bleomycin biosynthesis) may 
25 be inserted into the PKS to produce novel polyketide related 
molecules of mixed origin. 

The open reading frames encoding the PKS described herein 
may also comprise portions encoding non-enzymatically active 
portions which nevertheless have a functional role as scaffold 
30 regions which space and stabilise the enzymatically active 

domains and/or modules of the PKS at appropriate distances and 
orientations, and which may have recognition and docking 
functions that order the domains and modules of the PKS in the 
correct spatial arrangement. Thus the nucleic acid sequences of 
35 the present invention comprise sequences encoding such scaffold 
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regions, either alone or in combination with sequences encoding 
domains or modules as described above. 

In a further aspect, the present invention provides an 
isolated nucleic acid molecule comprising a sequence encoding a 
polypeptide, which catalyses a step in the synthesis of a starter 
unit or substrate for polyketide synthesis, preferably in the 
synthesis of the trans-cyclopentane-1, 2, -dicarboxylic acid 
moiety used as a starter unit by the borrelidin PKS. The 
polypeptide may have activity as a dehydrogenase, 3-oxoacyl-ACP- 
reductase, cyclase, F420 dependent dehydrogenase, or 2- 
hydroxyhepta-2,4~diene-l,7-dioate isomerase. Preferably the 
polypeptide comprises the sequence encoded by one of the group 
of genes consisting of borC, borD, borE, borF, borG r borH, bqrM 
and borN, as shown in SEQ ID NO: 8, 9, 10, 12, 13, 14, 19 or 20. 

These genes may be rendered deleted, disrupted, or 
otherwise inactivated in a borrelidin-producing cell in order to 
abolish borrelidin production. Cell lines resulting from such 
changes may be chemically complemented by the addition of 
exogenous carboxylic acids which may be incorporated in place of 

♦ 

the natural starter unit. Thus, new borrelidin related molecules 
may be synthesised, which are initiated from the exogenous ly fed 
carboxylic acid. Such an approach is termed mutasynthesis . The 
genes responsible for trans-cyclopentane-1, 2, -dicarboxylic acid 
synthesis may be introduced into a heterologous polyketide 
producer cell to allow that cell to synthesise the alicyclic 
dicarboxylic acid as a starter unit for its own PKS, 
Alternatively, genes responsible for the synthesis of the 
starter unit may be over-expressed in order to improve the 
fermentation titres of borrelidin or borrelidin related 
molecules. In another approach they may be modified, or 
replaced by other synthetic genes directing the production of 
altered carboxylic acids, leading to the production of 
borrelidin related molecules. These techniques may be 
complemented by the modification of the loading module of the 
PKS as described above. 
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In a further aspect, the present invention provides an 
isolated nucleic acid molecule comprising a sequence encoding a 
polypeptide which catalyses a step in the modification of a side 
chain of a polyketide moiety, for example in the conversion of a 
methyl group to a nitrile moiety, e.g. at C12 of pre-borrelidin 
(14) . The polypeptide may have activity as a cytochrome P450 
oxidase, amino transferase, or NAD/quinone oxidoreductase. 
Preferably the polypeptide comprises the sequence encoded by one 
of the group of genes consisting of JborJ, borJ, and borK as 
shown in SEQ ID NO: 15, 16 or 17. 

Various of these genes may be deleted/inactivated such 
that borrelidin-related molecules, or shunt metabolites thereof, 
accumulate which represent intermediate stages of the process 
that introduces the nitrile moiety. The addition of heterologous 
genes to such systems may allow alternative elaboration of any 
accumulated biosynthetic intermediates or shunt metabolites 
thereof. Alternatively, the genes may be mutated in order to 
alter their substrate specificity such that they function on 
alternative positions of pre-borrelidin molecules in order to 
provide borrelidin-related molecules. In addition, the genes 
responsible for formation of the nitrile group may be over- 
expressed in order to improve the fermentation titres of 
borrelidin or borrelidin-related molecules. 

Alternatively, one, some or all of these genes may be 
introduced into cells capable of producing other polyketides to 
provide for desired side chain processing of that polyketide, . 
e.g. the introduction of a nitrile moiety. This opens up the 
possibility of specific biosynthetic introduction of nitrile 
moieties into polyketides, particularly at side chains derived 
from methylmalonyl-CoA or ethylmalonyl-CoA extender units. 
Purified enzymes (see below) may also be used to effect the 
conversion of polyketide side chains to nitrile moieties in 
vitro. 

In a further aspect, the present invention provides an 
isolated nucleic acid molecule comprising a sequence encoding a 
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polypeptide involved conferring resistance to borrelidin. The 
polypeptide may have homology to a threonyl tRNA synthase, and 
preferably has threonyl tRNA synthase activity. Preferably the 
polypeptide comprises the sequence encoded by the JborO gene as 
shown in SEQ ID NO: 21. A resistance gene such as JborO, carried 
on a suitable vector (see below) may be used as a selective 
marker. Thus cells transformed with such a vector may be 
positively selected by culture in the presence of a 
concentration of borrelidin which inhibits the growth of f or 
kills, cells lacking such a gene. 

In a further aspect, the present invention provides an 
isolated nucleic acid molecule comprising a sequence encoding a 
polypeptide involved in regulation of expression of one or more 
genes of the borrelidin gene cluster. In a preferred embodiment 
the polypeptide comprises the sequence encoded by the borL gene 
as shown in SEQ ID NO: 18, or as encoded by orfB8 or orfB12 as 
shown in SEQ ID NO: 29 or 33. Regulator genes may be engineered 
to increase the titre of borrelidin and borrelidin derivatives, 
or borrelidin related molecules and their derivatives produced 
by fermentation of the resulting cell lines. For example, 
repressors may be deleted/inactivated, and/or activators may be 
up-regulated or overexpressed, e.g. by increasing gene copy 
number or placing the coding sequence under the control of a 
strong constitutively active or inducible promoter. The borL 
gene or a portion thereof may also find use as a hybridisation 
probe to identify similar regulator genes located in or outside 
other biosynthetic gene clusters. 

In a further aspect, the present invention provides an 
isolated nucleic acid molecule comprising a sequence encoding a 
polypeptide having type II thioesterase activity. In a 
preferred embodiment the polypeptide comprises the sequence 
encoded by the JborB gene as shown in SEQ ID NO: 8. This nucleic 
acid may be introduced into a host cell to modulate the titre of 
a polyketide synthesised by that cell. In particular, the titre 
may be increased by 'editing' of the products of unwanted side 
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reactions (e.g. removal of acyl groups formed by inappropriate 
decarboxylation of extender units attached to KS domains) . 
However in various aspects it may be desirable to remove such an 
activity from a producer cell, for example to increase the 
variety of polyketide products produced by that cell, or to 
facilitate production of an analogue of a naturally produced 
polyketide which would normally be blocked by such an editing 
activity. 

The nucleotide sequences of the invention may be portions 
of the sequence shown in SEQ ID NO: 1, or the complement 
thereof, or mutants, variants, derivatives or alleles of these 
sequences . The sequences may differ from that shown by a change 
which is one or more of addition, insertion, deletion and 
substitution of one or more nucleotides of the sequence shown. 
Changes to a coding nucleotide sequence may result in an amino 
acid change at the protein level, or not, as determined by the 
redundancy of the genetic code. Thus, nucleic acid according to 
the present invention may include a sequence different from the 
sequence shown in SEQ ID NO: 1 yet encode a polypeptide with the 
same amino acid sequence. Preferably mutants, variants, 
derivatives or alleles of the sequences provided encode 
polypeptides haying the same enzymatic activity as those 
described herein. 

Where the sequence is a coding sequence, the encoded 
polypeptide may comprise an amino acid sequence which differs by 
one or more amino acid residues from the amino acid sequences 
shown in SEQ ID Nos : 2 to 44. Nucleic acid encoding a 
polypeptide which is an amino acid sequence mutant, variant, 
derivative or allele of any of the sequences shown is further 
provided by the present invention. Such polypeptides are 
discussed below. Nucleic acid encoding such a polypeptide may 
show greater than about 60% identity with the coding sequence of 
SEQ ID NO: 1, greater than about 70% identity, greater than 
about 80% identity, or greater than about 90%, 91%, 92%, 93%, 
94%, 95%, 96%, 97%, 98% or 99% identity therewith. Percentage 
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identity may be calculated using one of the programs such as 
BLAST or BestFit from within the Genetics Computer Group (GCG) 
Version 10 software package available from the University of 
Wisconsin, using default parameters. 
5 In preferred embodiments, whether coding or non-coding, 

the nucleotide sequences of the invention are capable of 
hybridising specifically with at least a portion of the sequence 
of SEQ ID NO: 1 or the complement thereof . 

For example, hybridizations may be performed, according to 
10 the method of Sambrook et al. (Sambrook et al., 1989), using a 
hybridization solution comprising: 5X SSC, 5X Denhardfs 
reagent, 0.5-1.0% SDS, 100 ng/ml denatured, fragmented salmon 
sperm DNA, 0.05% sodium pyrophosphate and up to 50% formamide. 
Hybridization is carried out at 37-42°C for at least six hours. 
15 Following hybridization, filters are washed as follows: (1) 5 
minutes at room temperature in 2X SSC and 1% SDS; (2) 15 
minutes at room temperature in 2X SSC and 0.1% SDS; (3) 30 
minutes-1 hour at 37°C in IX SSC and 1% SDS; (4) 2 hours at 42- 
65°C in IX SSC and 1% SDS, changing the solution every 30 
20 minutes . 

One common formula for calculating the stringency 
conditions required to achieve hybridization between nucleic 
acid molecules of a specified sequence homology is (Sambrook et 

* 

al., 1989): 

25 T m - 81.5°C + 16.6Log [Na+] + 0.41 (% G+C) - 0.63 (% formamide) - 

600/#bp in duplex 

As an illustration of the above formula, using [Na+1 = 
[0.368] and 50% formamide, with GC content of 42% and an average 
probe size of 200 bases, the T m is 57 °C. The T m of a DNA duplex 
30 decreases by 1 - 1.5°C with every 1% decrease in homology. 
Thus, targets with greater than about 75% sequence identity 
would be observed using a hybridization temperature of 42 °C. 
Such hybridisation would be considered substantially specific to 
the nucleic acid sequence of the present invention. 
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The nucleic acids of the present invention preferably 
comprise at least 15 contiguous nucleotides of SEQ ID NO: 1. 
They may comprise 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 
100, 120, 150, 200, 300, 500 or more contiguous nucleotides of 
SEQ ID NO: 1. 

The nucleic acids may be used e.g. as primers or probes 
for the identification of novel genes or other genetic elements, 
such as transcriptional regulatory sequences, from polyketide or 
macrolide biosynthetic gene clusters, e.g. sequences encoding 
enzymes of the PKS, or domains or modules thereof, enzymes 
involved in the biosynthesis of a starter unit, enzymes 
modifying side chains of polyketide moieties, transporters, 
resistance genes and regulatory molecules as described. 

Thus the present invention provides a method of 
identifying a novel polyketide biosynthetic gene cluster, or a 
portion thereof, comprising hybridising a sample of target 
nucleic acid with a nucleic acid of the present invention 
capable of hybridising specifically to a nucleic acid having the 
sequence of SEQ ID NO: 1 or a portion thereof. ' The target 
nucleic acid may be any suitable nucleic acid, and is preferably 
bacterial genomic DNA. 

Typically/ the method further comprises the step of 
detecting hybridisation between the sample of nucleic acid and 
the nucleic acid of the invention. Hybridisation may be 
measured using any of a variety of techniques at the disposal of 
those skilled in the art. For instance, probes may be 
radioactively, f luorescently or enzymatically labelled. Other 
methods not employing labelling of probe include amplification 
using PCR, RNAase cleavage and allele specific oligonucleotide 
probing . 

A method may include hybridization of one or more (e.g. 
two) probes or primers to target nucleic acid. Where the 
nucleic acid is double-stranded DNA, hybridization will 
generally be preceded by denaturation to produce single-stranded 
DNA. The hybridization may be as part of a PCR procedure, or as 
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part of a probing procedure not involving PCR. An example 
procedure would be a combination of PCR and low stringency 
hybridization. A screening procedure, chosen from the many 
available to those skilled in the art, is used to identify 
5 successful hybridization events and isolated hybridized nucleic 

acid. 

Those skilled in the art are well able to employ suitable 
conditions of the desired stringency for selective 
hybridisation, taking into account factors such as 
10 oligonucleotide length and base composition, temperature and so 

on, as described above. 

An isolated nuclfesic acid molecule of the invention may be 

an isolated naturally occurring nucleic acid molecule (i.e. 

isolated or separated from the components with which it is 
15 normally found in nature) such as free or substantially free of 

nucleic acid flanking the gene in the bacterial genome, except 

possibly one or more regulatory sequence (s) for expression. 

Nucleic acid may be wholly or partially synthetic and may 

include genomic DNA, cDNA or RNA. Where nucleic acid according 
20 to the invention includes RNA, reference to the sequence shown 

should be construed as reference to the RNA equivalent, with U 

substituted for T. 

The present invention further provides a vector comprising 
a nucleic acid according to the present invention. The vector 
25 is preferably an expression vector comprising a nucleic acid 

encoding a polypeptide of a polyketide biosynthetic gene cluster 
(preferably a borrelidin biosynthetic gene cluster) , or a 
portion thereof, as described. Suitable vectors comprising 
nucleic acid for introduction into bacteria or eukaryotic host 
30 cells can be chosen or constructed, containing appropriate 

regulatory sequences, including promoter sequences, terminator 
fragments, enhancer sequences, marker genes and other sequences 
as appropriate. Vectors may be plasmids, viral eg "phage", or 
"phagemid", as appropriate. For further details see, for 
35 example, Sambrook et al., 1989. Many known techniques and 
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protocols for manipulation of nucleic acid, for example in 
preparation of nucleic acid constructs, mutagenesis, sequencing, 
introduction of DNA into cells and gene expression, and analysis 
of proteins, are described in detail in Short Protocols in 
Molecular Biology, Second Edition, Ausubel et al. Eds, John 
Wiley & Sons 1992. The disclosures of Sambrook et al. and 
Ausubel et al. are incorporated herein by reference. 

In another of its aspects the present invention provides 
an isolated polypeptide encoded by a nucleic acid molecule of 
the invention as described herein. More particularly, there is 
provided an isolated polypeptide comprising an amino acid 
sequence as shown in any one or more of SEQ ID Nos.2 to 44 or a 
portion thereof. As set out above, these amino acid sequences 
represent translations of the longest possible open reading 
frames present in the sequence of SEQ ID NO: 1 and the 
complement thereof. The first amino acid is always shown as 
Met, regardless of whether the initiation codon is ATG, GTG, CTG. 
or TTG. 

A polypeptide which is an amino acid sequence variant, 
allele, derivative or mutant of any one of the amino acid 
sequences shown may exhibit at least 60%, 65%, 70%, 75%, 80%, 
85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity 
with the polypeptide of any one of the SEQ ID Nos.2 to 44, or 
with a portion thereof. Particular amino acid sequence variants 
may differ from those shown by insertion, addition, substitution 
or deletion of 1 amino acid, 2, 3, 4, 5-10, 10-20 20-30, 30-50, 
50-100, 100-150, or more than 150 amino acids. Percentage 
identity may be calculated using one of the programs such as 
FASTA or BestFit from within the Genetics Computer Group (GCG) 
Version 10 software package available from the University of 
Wisconsin, using default parameters. 

The present invention also includes active portions, 
fragments, and derivatives of the polypeptides of the invention. 

An "active portion" means a peptide which is less than the 
full length polypeptide, but which retains at least some of its 
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essential biological activity. For example, isolated domains or 
modules of the PKS as described above may be regarded as active 

portions of the PKS 

A "fragment" means a stretch of amino acid residues of at 

5 least about five to seven contiguous amino acids, often at least 
about seven to nine contiguous amino acids, typically at least 
about nine to 13 contiguous amino acids and, most preferably, at 
least about 20 to 30 or more contiguous amino acids. Fragments 
of the sequence may comprise antigenic determinants or epitopes 

10 useful for raising antibodies to a portion of the relevant 

polypeptide. Thus the polypeptide need not comprise a complete 
sequence provided in any one of SEQ ID Nos 2 to 44, but may 
comprise a portion thereof having the desired activity, e.g. an 
isolated domain or module, such as those of the PKS described 

15 above. 

A "derivative" of a polypeptide of the invention or a 
fragment thereof means a polypeptide modified by varying the 
amino acid sequence of the protein, e.g. by manipulation of the 
nucleic acid encoding the protein or by altering the protein 

20 itself. Such derivatives of the natural amino acid sequence may 
involve insertion, addition, deletion or substitution of one, 
two, three, five or more amino acids, without fundamentally 
altering the essential activity of the wild type polypeptide. 
Polypeptides of the invention are provided in isolated 

25 form, e.g. isolated from one or more components with which they 
are normally found associated in nature. They may be isolated 
from a host in which they are naturally expressed, or may be 

* 

synthetic or recombinant. 

The present invention also encompasses a method of making 

30 a polypeptide (as disclosed) , the method including expression 
from nucleic acid encoding the polypeptide (generally nucleic 
acid according to the invention) . This may conveniently be 
achieved by growing a host cell in culture, containing an 
expression vector as described above, under appropriate 

35 conditions which cause or allow expression of the polypeptide. 
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Polypeptides may also be expressed in in vitro systems, such as 
reticulocyte lysate systems. 

The method may include the step of introducing the nucleic 
acid into a host cell. The introduction, which may 
(particularly for in vitro introduction) be generally referred 
to without limitation as "transformation", may employ any 
available technique. For eukaryotic cells, suitable techniques 
may include calcium phosphate transf ection, DEAE-Dextran, 
electroporation, liposome-mediated transfection and transduction 
using retrovirus or other virus, e.g. vaccinia or, for insect 
cells, baculovirus. For bacterial cells, suitable techniques 
may include calcium chloride transformation, conjugation, 
electroporation and transfection using bacteriophage. As an 
alternative, direct injection of the nucleic acid could be 
employed. Marker genes such as antibiotic resistance or 
sensitivity genes may be used in identifying clones containing 
nucleic acid of interest, as is well known in the art. 

A polypeptide, peptide fragment, allele, mutant or variant 
according to the present invention may be used as an immunogen 
or otherwise in obtaining . specif ic antibodies, which may be 
useful in purification and other manipulation of polypeptides 
and peptides, screening or other applications. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates the structure of borrelidin and some 
related metabolites isolated from borrelidin producing 
organisms . 

Figure 2 illustrates the incorporation patterns for 13 C stable 
isotope labelled extension substrates and the position of the 
trans- cyclopentane-1, 2-dicarboxylic acid starter unit derived 
carbons . 

Figure 3 illustrates the organisation of the borrelidin 
biosynthetic gene cluster. Restriction sites: B, BamHI; Be, 
Bell; E, EcoRI; X, Xhol . 



31 



Figure 4 illustrates a scheme showing the proposed biosynthetic 
pathway for the trans-cyclopentane-1, 2-dicarboxylic acid starter 

unit. 

Figure 5 illustrates the organisation of the borrelidin PKS and 
5 the biosynthesis of the pre-borrelidin molecule. 

Figure 6 illustrates the proposed biosynthetic route for the 
introduction of the nitrile moiety at the C12 position of 
borrelidin. 

Figure 7 illustrates the proposed structure of the molecule 6. 
.0 Figure 8 illustrates the proposed structure of the molecules 7 & 
8. 

Figure 9 illustrates the molecular characterisation of the 4- 
hydroxyphenylacetic acid catabolic pathway in E. colx W. 



15 DETAILED DESCRIPTION OF THE INVENTION 

A cosmid library of S. parvulus Tu4055 genomic DNA was 

♦ 

constructed using fragments obtained from a partial digestion 
with Sau3AI that were cloned into pWE15 and introduced into E. 
coll cells using the Gigapack® III Gold Packaging Extract kit 
20 (Stratagene) - A library of 3000 E. coli transformants were 

screened for homology using a labelled probe that was generated 
using the DIG DNA Labelling and Detection Kit (Roche) . The probe 
used was a 1.7 kbp Bglll-BamHI fragment obtained from the gene 
that encodes module 6 of the third subunit of the oleandomycin 
25 PKS from Streptomyces antibioticus (Swan et al. , 1994). 

Clones that gave a positive response were selected and 
cosmid DNA isolated. Cosmid DNA was digested with BamHI and 
fragments less than 3 kbp in size were sub-cloned into pOJ260 
(Bierman et al., 1992). The plasmids were then used to transform 
30 S. parvulus Tu4055 protoplasts and resulting mutants were 

screened for the ability to produce borrelidin. Two mutants were 
identified as borrelidin non-producers, both of which were 
derived from plasmids that contained fragments of cosBor32A2 . 
These two fragments were of 1.97 and 2.80 kbp in size, and were 
35 later identified as adjacent fragments encoding parts of the 
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borrelidin PKS (borA2 & borA3) . Using cosBor32A2 as the probe, a 
second overlapping cosmid, cosBorl9B9 was identified from the 
original library. These two cosmids are sufficient to cover the 
entire borrelidin biosynthetic gene cluster (see figure 3) . 
5 The complete nucleotide sequence of cosBor32A2 and 

cosBorl9B9 was determined by shotgun sequencing of a Sau3AI- 
derived subclone library for each cosmid, consisting of 1.5-2.0 

m 

kbp fragments in pHSG397 (Takeshita et al., 1987). Specific 
details are provided in example 3. The complete, overlapping 

10 nucleotide-coding sequence for cosBor32A2 and cosBorl9B9 is 
presented as SEQ ID No.l. The region encoded by cosmid 
cosBor32A2 represents the sequence from nucleotide positions 0- 
40217 bp of SEQ ID No.l. The region encoded by cosmid cosBor!9B9 
overlaps this region by 4452 nucleotides, and corresponds to the 

15 nucleotide positions 35766-74787 bp of SEQ ID No.l. As described 
in more detail in the following text, we have performed gene 
inactivation experiments on many of the orfs identified to be 
encoded within SEQ ID No.l, and this leads us to identify the 
limits of the cluster. The borrelidin biosynthetic gene cluster 

20 is most probably contained between nucleotide positions 7 603 to 
59966 of SEQ ID No.l (JborB to JborO, which includes the JborA 
region) . Thus, these combined efforts have led us to the 
identification and sequencing of the DNA region encompassing the 
entire borrelidin biosynthetic gene cluster, and to the 

25 identification and description of the functional sequences 
encoded within this region. 

PKS GENES 

Encoded between positions 16184-50742 of SEQ ID No.l are 6 
30 orfs that display very high homology to the genes' that encode 

the PKSs of known macrolide producing organisms. These genes are 
designated borAl, borA2, borA3, borA4, borAS and borAS, and 
encode the borrelidin PKS as was demonstrated above by 
disruption of a 1.97 kbp region within borA2. The six orfs are 
35 arranged in a head-to-tail manner and each is terminated by an 
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in-frame stop codon. The nucleotide sequence and corresponding 
polypeptide sequence details are shown below in Table 1: 
Table 1 




borAl 
borA2 
borA3 



borA4 
borA5 
borA6 



ID No.l 



16184-18814 
18875-23590 
23686-34188 



34185-39047 
39122-45514 
45514-50742 



nu 



SEQ ID No. 2 
SEQ ID No. 3 
SEQ ID No. 4 



SEQ ID No. 5 
SEQ ID No. 6 



SEQ ID No. 7 



5 The gene borAl encodes the starter or loading module (SEQ 

ID No.l, position 16184-18814). The assignment of the start 
codon is not obvious for this open reading frame. The start 
codon given here is the first candidate in the correct frame, 
but there are at least another three possible start codons 
10 between the first and the beginning of the ATO domain sequence. 
The start codon given here leaves a significant N-terminal tail 
of 321 amino acids preceding the ATO domain. For comparison the 
N-terminal tail preceding the ATO of the erythromycin loading 
module is 108 amino acids and that of the avermectin loading 
15 module is 28 amino acids. It is therefore possible that one of 
the other candidate start codons could be correct; the most 
likely of these are at positions 16298, 16607 and 16901 of SEQ 
ID No.l. The length of the N-terminal tail suggests it could 
possibly represent a catalytic activity, although it does not 
20 have any significant homology to other sequences in the 
databases. The nucleotide sequence position and the 
corresponding amino acid sequence for each of the functional 
domains within the starter module are identified below in Table 

2: 
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Table 2 



Domain In borAl 


Bases in SEQ XD No.l 


Amino acids in SEQ ID 
No. 2 


ATO 


17147-18175 


322-664 


ACPO 


18263-18472 


694-763 



The gene borA2 encodes the first extension module (SEQ ID 
No.l, position 18875-23590). The nucleotide sequence position 
5 and the corresponding amino acid sequence for each of the 
functional domains within the first extension module are 
identified below in Table 3: 
Table 3 



Domain in bojrA2 


Bases in SEQ XD No.l 


Amino acids in SEQ ID 
No. 3 


KS1 


18974-20251 


34-459 


ATI 


20543-21529 


557-885 


KR1 


22280-23011 


1136-1379 


ACPI 


23129-23332 


1419-1486 



10 The gene borA3 encodes the second and third extension 

modules (SEQ ID No.l, position 23686-34188). The nucleotide 
sequence position and the corresponding amino acid sequence for 
each of the functional domains within the second and third 
extension modules are identified below in Table 4: 

15 Table 4 



Domain in bojcA3 


Bases in SEQ ID No.l 


Amino acids in SEQ ID 
No. 4 


KS2 


23785-25062 


34-459 


AT 2 


25360-26346 


559-887 


DH2 


26392-26835 


903-1050 


KR2 


27745-28476 


1354-1597 


ACP2 


28567-28767 


1628-1694 


KS3 

i 


28855-30132 


1724-2149 


AT 3 


30418-31413 


2245-2576 
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DH3 " 


31462-31887 


KR3 


32863-33606 



10 



2593-2734 
3060-3307 



15 



3340-3406 



The gene borA4 encodes the fourth extension module (SEQ ID 
No.l, position 34185-39047). The nucleotide sequence position 
and the corresponding amino acid sequence for each of the 
functional domains within the fourth extension module are 
identified below in Table 5: 
Table 5 

Domain in boxA4 

No . 5 




KS4 


34284-35561 


34-459 


AT 4 


35847-36842 


555-886 


KR4 


37719-38453 


1179-1423 


ACP4 


38559-38759 


1459-1525 



The gene borAS encodes the fifth extension module (SEQ ID 
No.l, position 39122-45514)- The nucleotide sequence position 
and the corresponding amino acid sequence for each of the 
functional domains within the fifth extension module are 
identified below in Table 6: 
Table 6 

in SEQ ID No.l 




KS5 



ATS 



DH5 



ACP5 



39221-40492 



40778-41785 



41834-42259 




43322-44191 
44207-44947 
45044-45244 



34-457 



553-888 



905-1046 



1401-1690 



1696-1942 
1975-2041 



The gene borA6 encodes the sixth extension module and the 
chain terminating thioesterase (SEQ ID No.l, position 45514- 
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50742) . The nucleotide sequence position and the corresponding 
amino acid sequence for each of the functional domains within 
the sixth extension module are identified below in Table 7: 
Table 7 



Domain in borA6 


Bases in SEQ ID No.l 


Amino acids in SEQ ID 
No. 7 


KS6 


45622-46884 


37-457 


AT6 


47176-48162 


555-883 


KR6 


48814-49518 


1101-1335 


ACP6 


49624-49824 


1371-1437 


TE 


49894-50637 


1461-1708 



5 

The identification of functional domains and their 
boundaries as described in the aforementioned are determined 
based on the similarities to the conserved amino acid sequences 
of other modular PKSs such as those for the rapamycin (Schwecke 

10. . et al., 1995; Aparicio et al., 1996) and erythromycin (Cortes et 
al., 1990) biosynthesis. The limits of the catalytic domains are 
established on the basis of homology to other PKS clusters and 
the chosen point at which a domain starts or finishes is not 
absolutely defined, but selected based on the aforementioned 

15 considerations. In the case of the (3-keto processing domains it 
is least obvious, as there is typically a large region not 
assigned to a functional domain that precedes the KR domain. 
This region may be structurally important, or required for 
stability of the PKS dimer. An unusual characteristic of the 

20 borrelidin PKS is that all of the individual enzymatic domains 
appear to be catalytically competent based on their 
oligonucleotide/amino acid sequence, and are all necessary in 
order to provide the p-keto processing required to produce the 
functional groups observed in borrelidin. This is rather unusual 

25 as the majority of modular PKS sequences so far reported contain 
one or more inactive domains, an exception being for example the 
spinosyn PKS (Waldron et al., 2001; US 6,274,50 Bl) . 
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The loading module of the borrelidin PKS exists as a 
discrete protein. This is rather unusual as the majority of 
loading modules are found on the same protein as the first 
extension module. Exceptions to this include, for example, the 
nystatin (Brautaset et al., 2000) and amphotericin (Caffrey et 
al., 2001) PKSs. The loading module, which consists of an AT-ACP 
didomain, is similar to the broad specificity loading modules of 
the avermectin and the erythromycin PKSs, which accept a number 
of alternative starter acids, and are of use in generating 
libraries of novel polyketides (Marsden et al., 1998; Pacey et 
al., 1998) . The AT domain of the borrelidin PKS loading module 
diverges from the vast majority of AT domains as the active site 
serine residue is replaced with a cysteine such that the active 
site motif is GXCXG (specifically GHCYG) . In most available 
type I PKS AT domain sequences, the conserved active site motif 
is GXSXG; the same motif is observed in lipases, fatty acid 
synthases and most thioesterases . The nucleophilic serine is 
substituted by cysteine in two NRPS thioesterase domains, 
specifically the synthetases responsible for the production of 
mycobactin and pyochelin (Shaw-Reid et al., 1999). A GXCXG motif 
is also observed in a thioesterase-like domain of ORFl in the 
bialaphos cluster (Raibaud et al., 1991). It has been suggested 
that since it is not possible to move between the two types of 
serine codons by a single base change, active sites containing 
an essential serine residue may lie on two lines of descent from 
an ancient ancestral enzyme that had a cysteine instead of a 
serine in its active site (Brenner, 1988). The- presence of 
enzymes containing cysteine in the active site may support this 
view. It may alternatively be the case that cysteine arises in 
these active sites because it is possible to move from one type 
of serine codon to the other via a cysteine which would remain 
active . 

The AT domains of PKSs select a particular carboxylic acid 
unit as substrate. This selectivity has been shown to correlate 
with certain motif signatures within the AT domain (Reeves et 
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al., 2001; WO 02/14482) . The borrelidin loading module AT domain 
motif differs from any described so far, which is not surprising 
as this AT domain is the first to be sequenced that selects an 
alicyclic dicarboxylic acid- The AT domains for the borrelidin 
5 . PKS extension modules display the expected active site motif 
GXSXG, and also each contain the expected motifs for the 
selection of malonyl-CoA or methylmalonyl-CoA (Reeves et al., 
2001; WO 02/14482) . The malonyl-CoA selective AT domains (ATI, 
AT2 and AT6) show very high similarity to one another, both at 

10 the protein and the DNA level. The same is true for the 

methylmalonyl-CoA selective AT domains (AT3, AT 4 and ATS) ; two 
of these AT domains (AT3 and AT4) have identical amino acid 
sequences throughout the conserved region. The high similarity 
of ATS to AT3 and AT 4 is evidence that the extender unit 

15 selected in module 5 is methylmalonyl-CoA, and that the 

borrelidin C12-methyl group thus incorporated is subsequently 
modified to a nitrile function after incorporation into the PKS, 
To demonstrate that we can alter the PKS derived structure 
of borrelidin, the AT domain of module 4 (the AT domain encoded 

20 by borA4) was replaced by the AT domain of module 2 of the 

rapamycin PKS (rapAT2) using a replacement strategy (see example 
6). This gave strain S. parvulus Tii4055/467. Upon fermentation 
and LCMS analysis of culture extracts of this mutant, it was 
determined that some borrelidin was produced and a new, more 

25 polar compound was observed which had a m/z value 14 units lower 
than borrelidin. This is consistent with incorporation of a 
malonate rather that a methylmalonate extender unit by module 4 
of the PKS to produce 10-desmethyl borrelidin 3. 

In addition to production by domain swapping methods, 3 

30 was also generated by introducing specific mutations, into the 
module 4 AT domain selectivity motif (Reeves et al., 2001; WO 
02/14482) (see example 7) . Such a change affects the selectivity 
of the AT domain such that it selects a substrate molecule of 
malonyl-CoA preferentially over methylmalonyl-CoA. Thus, the 

35 amino acid motif YASH at positions 739 to 742 of SEQ ID No. 5 was 
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mutated to HAFH to give strain S. parvulus TU4055/472. Upon 
fermentation and LCMS analysis of culture extracts of this 
mutant it was determined that borrelidin was produced in 
addition to a new, more polar compound with a m/z value 14 units 
lower than borrelidin. This new compound is identical to that 
described above and thus is consistent with incorporation of a 
malonate rather that a methylmalonate extender unit by module 4 
of the PKS to produce 3. 

These results clearly indicate that the borrelidin PKS is 
amenable to genetic manipulation and to the exchange of native 
sequence for that of a heterologous strain. Thus biosynthetic 
engineering, by the methods described above, of the borrelidin 
PKS may be used to provide novel borrelidin-like molecules. 

The borrelidin loading module is of interest due to the 
unique structure of its cognate substrate. To examine its 
potential use in other systems, we replaced the loading module 
native to the erythromycin PKS with the borrelidin loading 
module; this experiment is analogous to those done previously 
with the avermectin loading module (WO 98/01546; Marsden et al. r 
1998) . We anticipated that the new strain would be capable of 
producing novel erythromycin like molecules in which the CIS- 
ethyl group is replaced with an exogenous ly supplied trans- 
cyclopentane-l,2-dicarboxylic acid moiety. The methodology used 
to perform this experiment was similar to that described in WO 
98/01546, but the transformation was performed using a mutant 
Saccharopolyspora exythraea DM (Gaisser et al., 2000) which 
accumulates the aglycone product erythronolide B rather than the 
fully processed macrolide, as well as using S. erythraea WT. 
This experiment is described in example 8. 

It is not evident from SEQ ID No.l, which of four 
candidate start codons is correct for borAl. The four most 
obvious candidate start codons are at nucleotides 16184, 16298, 
16607 and 16901 of SEQ ID No.l. The earliest of these possible 
start codons is used in giving the amino acid sequence for SEQ 
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ID No .2. A pile-up of this loading module with the erythromycin 
and avermectin loading modules indicates that the ATO domain 
starts at position 321 of SEQ ID No. 2, and that there is a long 
N-terminal tail. No significant homology is found for the first 
298 amino acids of JborAl. The borrelidin loading module is 
encoded by a discrete orf , and in order to retain this 
architecture the splice site chosen for joining the borrelidin 
PKS loading module sequence to the erythromycin PKS loading 
module sequence is at the beginning of the homologous region of 
the KS1 domain of borA2, at amino acids 42-44 of SEQ ID No. 3. 
This approach maintains the putative docking regions at the end 
of BorAl and start of BorA2 that are believed to be essential 

* 

for the production of a functional PKS assembly. To maintain the 
continuity of this experiment this loading module was fused to 
the equivalent point at the beginning of the KS1 domain of 
eryAI. The resulting mutants S. erythraea DM/CJM400-403 were 
fermented and analysed by negative ion LCMS using standard 
protocols . This analysis clearly indicated the presence of a new 
compound 6 with m/z = 485.3 as expected (figure 7). It therefore 
appears that the borrelidin loading module has utility for the 
biosynthetic engineering of other PKSs (i.e. not the borrelidin 
PKS) to produce further novel polyketides bearing a trans- 
cyclopentane-1, 2-dicarboxylic acid moiety. It is also clear that 
the diversity of products arising from hybrid PKSs derived from 
the borrelidin loading module may be further enhanced through 
the exogenous feeding of carboxylic acids other than the cognate 
substrate . 

The most striking feature of the borrelidin PKS is the 
clear divergence from the normal co-linear, processive mode of 
operation for type-I modular PKSs. Borrelidin is a nonaketide 
(one loading plus eight extension steps), but only seven modules 
(one loading and six extension modules) are present in the 
cluster. Analysis of the PKS domains with respect to the 
chemical structure of borrelidin correlates with the fifth 
extension module (BorA5) being used iteratively . f or three rounds 
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of chain elongation as shown in figure 5. Thus, the fifth f sixth 
and seventh rounds of chain elongation occur on BorAS with the 
incorporation of three methylmalonyl-CoA extension units , and 
with full reductive processing of the D-keto groups to methylene 
moieties. As described in the aforementioned, the divergence 
from co-linear operation for modular PKSs is unusual and limited 
to a few examples. The present example is interesting as it 
occurs on a module that reduces the p-keto group fully to a 
methylene moiety and which is followed by an inter- rather than 
intra-protein transfer of the growing chain. This is also the 
case for the two known examples of erroneous iterative use of 
type-I modules by the erythromycin (Wilkinson et al., 2000) and 
epothilone (Hardt et al., 2001) PKSs. It is noteworthy that this 
full reduction makes these modules functionally equivalent to 
15 fatty acid synthase (FAS) . The type-I PKS modules that can 
operate iteratively may have retained FAS like activity. 

Although it appears that BorA5 is used iteratively (three 
times), two other possible scenarios may explain borrelidin 
biosynthesis given the genes present in the borrelidin 
20 biosynthetic cluster. Firstly, two modules may be ^missing' from 
the cluster, but could be present at some other location in the 
genome. However, in the majority of cases investigated, the 
genes required for biosynthesis of secondary metabolites in 
actinomycetes are clustered in a single locus. The second 
25 possibility is that three separate BorAS dimers assemble, and 
that each catalyses a round of chain elongation; thus the 
process would be processive. However, this scenario requires 
that three times the amount of BorAS is produced with respect to 
the other PKS proteins, but the organisation of the borrelidin 
30 gene cluster does not indicate that the regulation of borA5 

differs from that of any of the other PKS genes. In addition, 
this scenario does not fit with the common thinking as to the 
roles of inter-protein linkers, which suggests that there is a 
specific recognition between the N- and Oterminal ends of the 
35 proteins of the biosynthetic complex that need to interact, 
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enabling specific binding between modules encoded on different 
proteins (Ranganathan et al., 1999; Wu et al., 2001). 

To address the issues described above, the two proteins 
encoded by borA4 and borA5 were fused after manipulation at the 

.5 genetic level to provide strain S. parvulus Tu4055/JborA4-A5 (see 
example 9) , and separately the two proteins encoded by borA5 and 
borA6 were fused in an analogous manner to provide strain S. 
parvulus Tii4055/JborA5-A6 (see example 10) . Therefore, the new, 
fused, bi-modular genes make it impossible to assemble three 

10 separate molecules of BorAS, or for another protein (s) encoded 

by a gene(s) remote from the borrelidin cluster to act in tandem 
with BorAS. Upon fermentation of strains S. parvulus 
Tii4055/JborA4-A5 & /borA5-A6, followed by extraction and 
analysis, the production of borrelidin was verified at a reduced 

15 but significant level (25±4% and 26±5% respectively) when 

compared to the WT strain. Thus, the production of borrelidin by 
these mutants indicates that module 5 of the fused BorA4-A5 or 
BorA5-A6 operates in an iterative manner. . In addition, a mutant 
was formed in which both of these fusions were present, S. 

20 parvulus TH4055 /borA4-A5-A6, and after fermentation, extraction 
and analysis, the production of borrelidin was verified at 18±5% 
when compared to the WT strain. 

The ability of BorA5 to operate iteratively has great 
potential for the engineering of heterologous PKSs to provide 

25 macrolactones with expanded ring sizes. To examine this 

possibility we swapped BorA5 into the erythromycin PKS in place 
of module 4 of DEBS2 . This was done by replacement of the 
appropriate gene fragment in both the erythromycin producer 5. 
erythraea WT and S. erythraea DM. This experiment was chosen as 

30 both modules recruit methylmalonyl-CoA extender units and 

process the p-keto functions formed through to methylene groups. 
In addition, the stereochemistry of the resulting methyl group 
in the polyketide chain is the same in both cases . Of most 
significance is the fact that module 4 of DEBS2 is known to 
35 perform erroneous iterative rounds of chain elongation 
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(Wilkinson et al., 2000), indicating that such a process can 
indeed occur at this location within the PKS and give rise to 
products that can be fully processed by DEBS3 , making it an 
attractive target to introduce specific iterative use of a 
5 heterologous module to make 16- and 18-membered macrolides. 

Briefly, the region of DNA encoding borA5 is swapped for 
that encoded by module 4 of eryA2, which encodes the C-terminal 
portion of DEBS2 of the erythromycin PKS (see example 11) . The 
resulting mutant S. erythraea DM/421 is grown and extracted as 
10 for the production of metabolites by S. erythraea strains 

(Wilkinson et al., 2000) and then analysed by LCMS. Two new 
significant compounds, which are less polar than erythronolide 
B, are observed. These had an m/z of 435.5 (7, [MNa + ] ) and 477.5 
(8, [MNa + J ) respectively, which is consistent with the 
15 production of two new ring expanded erythronolide B analogues 
(figure 8) . Compound 7 with m/z = 435.5 is consistent with the 
presence of the 16-membered ring-expanded erythronolide B 
related macrolide reported previously as a minor component of S. 
erythraea WT fermentations (Wilkinson et al. , 2000). The 
20 inclusion of such a module into other positions of the 

erythromycin PKS or into other PKSs may allow the production of 
novel, ring expanded polyketides in a similar manner. In 
addition, it is possible to perform this experiment by swapping 
only the region of the DEBS module 4 from the start of the 
25 conserved region of the KS4 to the end of the ACP4 domain; this 
arrangement retains the C- and- N-terminal regions at the end of 
DEBS2 and DEB S3 respectively, to ensure the mutual recognition 
and docking of these proteins . 



30 NON-PKS GENES 

Both upstream and downstream of the PKS encoding genes are 
other orfs involved in the biosynthesis of borrelidin. An orf is 
designated as consisting of at least 100 contiguous nucleotides, 
that begins with an appropriate start codon and finishes with an 

35 appropriate stop codon, and which has an appropriate codon bias 
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for protein-coding regions of an organism whose DNA is rich in 
the nucleotides guanine and cytosine. In the DNA sequence both 
upstream and downstream of the borrelidin PKS genes (JborAl- 
borA6) there are a number of orfs that could be identified by • 
comparison to other sequences in the NCBI database (see figure 
3) . The nucleotide sequence details of these orfs are given 



below in Table 8: 
Table 8 



1 f^. *"» <~» 


Bases in SEO ID No - 1 

- 


Corresponding 
polypeptide sequence 
number 1 


I ho r*R 


7603-8397c 


SEQ ID No. 8 "1 


1 ho rC 


8397-9194c 


SEQ ID No. 9 


1 ho t~7~) 


9244-9996c 


SEQ ID No. 10 \ 


1 ho rK 


9993-11165c 


SEQ ID No. 11 


1 hnrF 1 


11162-11980c 


SEQ ID No. 12 


1 borG \ 


11992-13611c 


SEQ ID No. 13 


borH 

1 +~/\mf mlm XX 1 


13608-15659c" 


SEQ ID No. 14 


JborJ 


50739*-52019 


•SEQ ID No. 15 


borJ 


52113-53477 


SEQ ID No. 16 


borK 


53486-54466 


SEQ ID No. 17 


borL 


54506-56176 


SEQ ID No. 18 


borM 


56181"-57098 


SEQ ID No. 19 


borN 


57112-57858 


SEQ ID No. 20 


borO 


57939-59966 


SEQ ID No. 21 


orfBl 


2-313 


SEQ ID No. 22 


orfB2 


501*-3107 


SEQ ID No. 23 


orfB3 


3172-3810c 


SEQ ID No. 24 


orfB4 


3935-4924C 


SEQ ID No. 25 


orfBS 


5123-5953 


SEQ ID No. 26 


orfB6 


5961-6518*0 


SEQ ID No. 27 


orfB7 


6564'-7538 


SEQ ID No. 28 


orfB8 


60153-60533c 


SEQ ID No. 29 



* 
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orfB9 
orfBl 0 



60620-61003 
61188~-61436 



SEQ ID No. 30 
SEQ ID No -31 



orfBll 
orfBl2 
orfBl3a 



61526-61738 
61767-62285c 
62750-63067c 



SEQ ID No. 32 
SEQ ID No. 33 
SEQ ID No. 34 



orfBl3b 



orfBl4 



62586-62858C 
63155-65071c 



SEQ ID No. 35 
SEQ ID No. 36 



orfBl 5 
orfBl 6 
orfBl 7 
orfBl 8 



65374-65871 
65942-68305c 
68290-68910C 
69681-70436 



SEQ ID No. 37 
SEQ ID No. 38 
SEQ ID No. 39 



SEQ ID No. 40 



orfBl 9 
orfB20 
orfB21 



orfB22 



70445-71848 
71851-72957 
73037-73942 
73995-74534C 



SEQ ID -No. 41 
SEQ ID No. 42 
SEQ ID No. 43 



SEQ ID No. 44 



[Note 1: c indicates that the gene is encoded by the complement 
DNA strand; Note 2: for each open reading frame given above, the 
longest probable open reading frame is described. It is 
sometimes the case that more than one potential candidate start 
codon can been identified. One skilled in the art will recognise 
this and be able to identify alternative possible start codons. 
We have indicated those genes which have more than one possible 

start codon with a symbol.] 

Potential functions of the predicted polypeptides (SEQ ID 
No. 7 to 43) were obtained from the NCBI database using a BLAST 
search. The best matches obtained from these searches are 
described below in Table 9: 



Table 9 



Gene 


Significant protein 
match 


Score 


Accession 
GenBank 


Proposed 
function 


orjfBl 


hypothetical protein, 
no full length hits, 
high GC codon 
preference 






unknown 



* 



46 



1 or£B2 1 


SCM2.07, hypothetical] 
protein (S. 
coelicolor) 


998 1 


NP_625154 


unknown I 


or£B3 \ 


SCF7 6 • u / / nypotne ticai i 
protein, (S. 1 
coelicolor) 


OCQ I 


vitr O^fi / O D 


vin.Kn.owri i 


orfB4 


SCF7 6.06, araC family! 
transcriptional 
regulator (S. 
coelicolor) 


412 


NP b<£4 /oi) 


un. Known 


orfBS 


SCF76.05c, non-heme 1 
chloroperoxidase (S. 
coelicolor) 


495 


NP_624784 


non-heme 

chloroperoxidase 


orfB6 


SCF76.09, hypothetical 
protein (S. S 
coelicolor) 


159 


NP_624788 


unknown 


orfBl 


SCF76.08c, 

hypothetical protein 
(S. coelicolor) 


473 


i NP_624787 


unknown 


borB 


PteH, polyene 
macrolide thioesterase 
1 (S. avermitilis) 


244 


BAB69315 


type II 
thioesterase 


borC 


XF1726, 2, 5-dichloro- 
2, 5-cyclohexadiene- 
1,4,-diol 
[ aenyarogenase \Ayieixa 
fastidiosa strain 
9a5c) e 


160 


NP_299015 

l 


dehydrogenase 


borD 


FabG, 3-oxoacyl-ACP 
reductase precursor, 
{Plasmodium 
falciparum) 


124 


AAK83686 


3-oxoacyl-ACP 
reductase 
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1 borM I 


SCL6 . 10 , hypothetical I 
protein siiailar to 
putative F420- 
dependent 

dehydrogenase (5. 

COeJLJLCOJLOJ:) f 1 


108 1 


CAB76875 j 


F420 dependent 1 
dehydrogenase 

t 

* 1 


borN 


SC1C2 . 27 , hypothetical 
protein, 2- 
nyaroxynepta ^ / f* 
diene-1, 7-dioate 
isomerase superfamily 
(S. coelicolor) 


215 


NP_629680 


2-hydroxyhepta- 
2 , 4-diene-l , 7- 


borO 


ThrS, threonyl-tRNA 
synthetase 


627 


NP_301410 


threonyl-tRNA 
synthetase, self 


orfB8 


conserved hypothetical 1 
protein 

{Methanosarcina 

i ~4 4 m *f T T"/**% T>"» ^5 •*» ^ « V f" O 7\ I 

j acecivoxaiis st-r. . 
(Pfam pulls out weak 
MarR family) 


37 


NP_617908 

1 * 


possible 
regulator 




j puuauive auLi sigma 
factor antagonist 
{Streptomyces 

J coelicolor) 


no 


NP 631789 


1 anti-sioma factor 8 
antagonist 


orfBl 


1 conserved hypothetical 
protein (S. 
coelicolor) 


95 


NP_631790 


unknown 


orJfBI 
I 


! hypothetical protein, 
no full length hits, 
high GC codon 
preference 






unknown 
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orfBl | putative regulator (S. I 92 
2 ! coelicolor) 



orfBl 
3a 



orfBl 
3b 



orfBl 
4 



NP 631494 



regulator (of a 
two component 
system, maybe 
membrane sensor) 



putative 
acetyltrans 
coelicolor) ; 
putative 
acetyltrans 
coelicolor) 



58 



NP 625155 tentative 



ferase (S. 



100 



NP 625155 



ferase (S. 



assignment of 
acetyltransf erase 
in two frames, or 
sequencing error 
and should be in 
a single frame 



putative lipoprotein 
(S. coelicolor) 



386 



NP 631245 unknown 



orfBl | hypothetical protein 
5 1 {S. coelicolor) 



41 



NP 631424 unknown 



orfBl j putative formate 
6 I dehydrognease (S. 

coelicolor) 
(Pfam matches to 
molybdopterin 
oxidoreductase/ 
formate dehydrogenase 
alpha subunit) 
orfBl | conserved hypothetical 
7 [ protein, S. coelicolor 

SCBAC25F8.16 



915 



NP 626265 oxidoreductase 



175 



NP 631569 unknown 



orfBl I product 

8 | (Streptojnyces 
aureofaciens) 

orfBl I putative 

9 I dehydrogenase 
aureofaciens) 



unknown 396 



AAD23399 unknown 



aldehyde 
(S. 



635 



AAD23400 | aldehyde 

dehydrogenase 



50 



orfB2 
0 


putative alcohol 
dehydrogenase (S. 
coelicolor) 


450 


NP_630527 


alcohol 
dehydrogenase 


orfB2 
1 


hypothetical protein 
(5. coelicolor) 


395 


NP_630528 


unknown 


orfB2 
2 


putative calcium 
binding protein (S. 
coelicolor) 


160 


NP_631687 


calcium binding 
protein 



Analysis of the functions of the putative gene products 
indicates that the genes borB to borO most probably form the 
boundaries of the borrelidin biosynthetic cluster- Evidence to 
5 support this came from the disruption of borB2, which produced 

* 

borrelidin at levels indistinguishable from the wild type 
parental strain. In addition, borB3 to borBl have homologues in 
the Streptomyces * coelicolor A3 (2) genome encoded on cosmid 
SCF76; the same orfs are present, but in a different order. The 
10 orfs borB8 to borBlO are arranged identically to homologues in 
the 5. coelicolor A3 (2) cosmid SC5E3. The orfs borB18 to borB21 
have homologues that are arranged similarly in the S. coelicolor 
A3 (2) cosmid SC1A2 . The orf borB13 contains a frame-shift and 

■ 

thus any gene product would most probably be inactive. In 
15 addition no function can be readily deduced for the products of 
these orfs during borrelidin biosynthesis . 

« 

Starter unit biosynthesis genes 

In order to identify the genes that are involved in the 

20 biosynthesis of the trans-cyclopentane w l, 2-dicarboxylic starter 
unit, each of the genes JborB to JborO was disrupted (eg. see 
examples 12-15) . This was done in a manner designed to minimise 
the possibility of polar effects, which was verified by 
successful in trans complementation with a full-length copy of 

25 the disrupted gene under the control of the ermE* promoter, 
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which gave back approximately WT levels of borrelidin production 
in each case. 

Each of the disrupted mutants was grown in triplicate as 
described in example 1, and borrelidin production assessed. 
Alongside these, each mutant was grown in triplicate and 
supplemented, after 24 hours, with exogenous starter acid to a 
final concentration of 1 mM, and borrelidin production assessed. 
Extraction and analysis for borrelidin provided the data that 
are described below in Table 10: 
Table . 10 



Borrelidin ! 
biosynthetic gene 1 
disrupted 

n 


Borrelidin production 
without feeding 
(%relative to WT) 

i 


Borrelidin productxon 
with feeding I 
(%relative to unfed 
WT) 


Wild type [ 
(control) J 


100116, (100±2) ! | 


363±65, (269149) 


JborB 


75111, (43±20) 


172±51 


JborC " 1 


0, (1013) 


933142 


borD 


7±1, (0) " 1 


75115 


borE 


2±1 


122123 


borF " 


3±2 " 1 


201152 


JborG 


1111, (32±3) " 


15321142 


Jbortf 


1712, (23±13) 


203140 


1 borl 


0, (0) 


0, (0) j 


\ borJ 


0, (0) 


0, (0) 1 


borK 


0, (6±1) 


319154, (464118) 


JborL 


" 0, (0) ' 


408170, (399169) 


1 borM . 


0, (6±3) 


461129, (55,3166) 


1 borN 


25+9, (34±3) 


68112, (4619) 


borO 


N/A 


N/A 



[Note 1: The values given in brackets indicate where repeat runs 
of some experiments were performed; Note 2: N/A = not 
applicable. ] 
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Based on the data in table 10, it is clear to one skilled 
in the art that the gene products BorC-F and K-M are essential 
or very important for the biosynthesis of trans-cyclopentane- 
1,2-dicarboxylic acid, as these mutants produced no or very low 
levels of borrelidin without the addition of exogenous starter 
acid, whereupon they produced borrelidin at levels approaching, 
or better than, that of the WT organism. In addition the gene 
products BorG, H, and N appear to be involved in, but not 
essential for, the biosynthesis of the starter unit, as they 
produced significantly lower levels of borrelidin unless the 
addition of exogenous starter acid, whereupon they produced 
borrelidin at levels approaching or better than that of the WT 
organism. 

The normal metabolic function of BorN homologues is the 
production of 2-oxohepta-3-ene-l, 7-dioate 10, a key step in the 
catabolism of tyrosine via 4-hydroxyphenyl acetic acid 9 (figure 
9) (Prieto et al. t 1996). Therefore, 10 may be an intermediate 
in the biosynthetic pathway to trans-cyclopentane-1, 2- 
dicarboxylic acid. The ability of the mutant disrupted in JborN 
to produce borrelidin, albeit at a reduced level, most probably 
lies in the presence of a homologue elsewhere in the genome 
utilised in the catabolism of tyrosine during primary 
metabolism. 

The intermediate 10 contains all the required 
functionality for the eventual formation of trans-cyclopentane- 
1, 2-dicarboxylic acid. The most probable, next step of the 
biosynthesis is the reduction of the 3-ene position in a 
reaction similar to that catalysed by an enoyl reductase. 
Potential enzymes responsible for this step are BorC, BorD, BorK 
or BorM; these enzymes are all involved in borrelidin starter 
unit biosynthesis as seen from the data in table 10. The 
resulting 2-oxohepta-l, 7-dioate 11 is one possible substrate for 
cyclisation through formation of a new C-C bond between C6 and 
C2. Another possible substrate for this cyclisation would be 2- 
hydroxyhepta-1, 7-dioate 12 or some activated form thereof. This 
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would presumably be formed from 11 by the action of an 
oxidoreductase such as BorC, BorD or BorM. 

The key cyclisation step is most probably catalysed by 
BorE, which displays similarity to O-succinylbenzoyl-CoA 
synthase and chloromuconate cycloisomerase. These enzymes belong 
to the enolase super-f amily, the members of which share the 
common ability to stabilise the formation of an anion on the 
carbon atom adjacent to a carboxylate group (Schmidt, et al., 
2001) . It is further notable that the substrate for muconate 
cycloisomerase is a hexa-1, 6-dioate, which is similar in gross 
structure to 11 and 12. Abstraction of a proton and formation of 
an anion at C6 of 11 or 12 (or an activated form thereof , eg. 
13) with subsequent cyclisation to C2 provides the correctly 
substituted cyclopentane ring structure, although the 
intermediacy of 11 as substrate would require some further 
processing of the substituted cyclopentane, most probably via 
elimination of water to give the C2-symmetric cyclopent-l-ene- 
1, 2-dicarboxylic acid. However, the feeding of cyclopent-l-ene- 
1, 2-dicarboxylic acid, or ethyl esters thereof, to S. parvulus 
TU4055 strains disrupted in any of borC-E, or to WT strains, did 
- not produce any borrelidin, or did not produce borrelidin in any 
increased amount when compared to the unfed controls. These data 
indicate that this compound is probably not an intermediate in 
starter unit biosynthesis, and that the substrate of BorE is 
most probably the 2-hydroxyhepta-l, 7-dioate 12, or an activated 
form thereof (eg. 13) . A putative pathway for the biosynthetic 
pathway to trans-cyclopentane-1, 2-dicarboxylic acid is shown in 
figure 4 . 

The combined, specific genes required for the biosynthetic 
steps to trans-cyclopentane-1, 2-dicarboxylic acid are not clear, 
but probably are encoded by some combination of borC-H, borK, 
borM and borN. The lack of certain homologues of genes that are 
involved in the catabolism of 4-hydroxyphenyl acetic acid 9, and 
which would act prior to BorN in the pathway, is most probably 
an indication that primary metabolic genes perform these tasks. 

■ 
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The addition of exogenous trans-cylopentan£-l, 2-dicarboxylic 
acid to S. parvulus Tu4055 and related strains increases the 
titre of borrelidin in the order of 2- to 3-fold under our 
conditions, indicating that the biosynthesis of starter acid is 
5 a limiting factor in borrelidin biosynthesis. These data are 

consistent, with primary metabolic degradation of tyrosine . being 
the source of trans-cyclopentane-1, 2-dicarboxylic acid. 

In an attempt to further clarify which genes may be 
specifically responsible for biosynthesis of the starter unit, a 

10 number of co-culture experiments were performed with 

combinations of the different mutants - these require the 
knowledge that the gene products of borl and borJ are 
specifically involved in the formation of the Cl2-nitrile 
moiety, which is clarified by the data given in the following 

15 section below in combination with the data from table 10. In 

summary, the co-culture of mutants borET & borD" , and of borET & 
JborftT failed to produce any borrelidin whereas the co-culture of 
mutants borM~ & borJT, and borlW & borlC produced borrelidin at 
approximately WT levels. These data in combination to that in 

20 table 10 and below clearly indicate that borD, borE and borM are 
involved in starter unit biosynthesis, whereas JborJ, and 
possibly borK f are involved in the formation of the nitrile 
moiety at C12 of borrelidin. 

It is clear from the data in table 10 that exogenous 

25 addition of trans-cyclopentane-1, 2-dicarboxylic acid is 

sufficient to re-establish approximately WT levels, or better, 
of borrelidin production in mutants where genes that are 

■ 

involved in starter unit biosynthesis have been disrupted. These 
data indicate that there is no problem with the active uptake of 

30 added carboxylic acid by S. parvulus Tu4055, and that an 
activity is present which is capable of converting the 
carboxylic acid to a CoA thioester equivalent. Thus, given the 
known technologies of mutasynthesis, it appears that the 
addition of exogenous carboxylic acids to one of the 

35 aforementioned mutants, for example the borE' strain S. parvulus 
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Ttt4055 /borE:aac3 (IV) described in example 12, may lead to the 
production of borrelidin analogues in which the starter unit 
carboxylic acid moiety is replaced with a moiety derived from 
the exogenously added carboxylic acid. 

Specifically, when trans-cyclobutane-1, 2 -di carboxylic acid 
was fed to the strain S.parvulus Tu4055/bor£: aac3 (IV) in a 
manner analagous to that described in example 1 a new compound 
was produced in good yield. This compound eluted earlier than 
borrelidin using the analytical method described in example 4 
with a UV spectrum identical with that of the borrelidin 
chromaphore (2w = 258 nm) . The mass spectrum of the new 
compound displayed a signal (m/z = 474.4 in negative ion mode) 
that is 14 amu less than Borrelidin A itself. These data are 
consistent with the production of a new borrelidin in which the 
cyclopentane carboxylic acid moiety attached to C17 is replaced 
with a cyclobutane carboxylic acid moiety. Additionally, the 
feeding of DL-2-methylsuccininc acid led to the production of a 
new compound that runs chromatographically close with Borrelidin 
A using the analytical method described in example 4, and with a 
UV spectrum identical with that of the borrelidin chromaphore 
(Xraax - 258 nm) . The mass spectrum of the new compound displayed 
a signal (m/z = 462.3 in negative ion mode) which is 26 amu less 
than Borrelidin A itself; thus, it would appear that the feeding 
of this acid also leads to the production of a new borrelidin 
compound in which the cyclopentane carboxylic acid moiety 
attached to C17 is replaced with a (C18 or C19) C-methyl 
* substituted propionic acid. This is the first borrelidin 
utilising an acyclic starter acid. 

In an attempt to improve the titre of borrelidin produced 
in fermentation cultures of S. parvulus Tu4055, additional 
copies of the genes borE and borL were introduced into the 
organism in vectors that place them under the control of the 
strong constitutive promoter ermE* . It was anticipated that the 
over-expression of these genes would increase the intra-cellular 
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levels of the starter acid, which appears to be limiting with 
respect to borrelidin production. 

The genes borE and Jbori were amplified by PCR, cloned into 
the vector pEM4, and then introduced into S. parvulus TU4055 as 
described in examples 16 and 17 respectively. In addition, the 
vector pEM4 alone (not containing any insert) was also 
introduced in S. parvulus T\i4055 and used as a control. The 
resulting strains were grown, extracted and analysed by HPLC. 
Introduction of the vector as a control did not significantly 
effect the levels of borrelidin production. However, the 
expression of additional copies of either borE or borL in this 
manner brought about a 4.2±0.3 and 4.3±0.7 fold increase 

■ 

respectively in the titre of borrelidin relative to the wild 
type strain. Presumably, the steps of biosynthesis catalysed by 
their gene products are rate limiting, or alternatively their 
gene products may have a positive regulatory function. For 
example borL shows greatest homology to auxin response proteins 
from plants. Auxins are hormones involved in the regulation of 
various cellular processes in plants, and borL may represent the 
first example of a related gene having regulatory function in a 
bacteria. As controls, an additional copy of borJ, borO and 
borA5, under the control of ermE* in pEM4, were introduced into 
S. parvulus TU4055, but did not have any effect upon the 
borrelidin titre. This was anticipated as none of the respective 
gene products are anticipated to be involved in starter unit 
biosynthesis. In addition, up-regulation of the putative 
* stuttering' PKS module (borA5) did not increase borrelidin 
titre, indicating that iterative use of this module occurs, 
rather than three independent copies being utilized. The lack of 
an effect on titre when JborO is up-regulated indicates that 
there is most probably no limitation of borrelidin due to 
toxicity in the producing organism and so indicates that there 
is further scope for titre improvement. 
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Formation of the nitrile moiety at C12 

Sequence analysis of the AT domain of the borrelidin PKS 
module 3 indicates that the substrate utilised for the third 
round of chain extension is methylmalonyl-CoA. Thus, the carbon 
atom of the nitrile moiety most probably arises from the methyl 
group of methylmalonyl-CoA. This was verified by stable isotope 
feeding experiments. Feeding [2, 3- 13 C 2 ] sodium propionate to S. 
parvulus TU113 gave borrelidin which displayed intact labelling 
of the carbons at C4-C24, C6-C25, C8-C26, C10-C27 and C12-C28, 
and with similar specific incorporations, as expected (figure 
2) . These data indicate that the conversion of the C12 -methyl 
group occurs either during chain assembly at, or after, the 
incorporation of the third extension unit, or that it occurs 
after polyketide chain assembly and release from the PKS, Based 
on functional assignments given to the borrelidin biosynthetic 
genes in conjunction with the gene disruption data described in 
table 10 , both borl and borJ are clearly implicated in formation 
of the nitrile moiety at C12, while others such as borK may also 
be. 

The cytochrome P450 hydroxylase Borl shares greatest 
similarity to TylHI, which catalyses the hydroxylation of an 
exocyclic methyl group of the tylosin macrolactone prior to 
addition of a deoxyhexose moiety (Fouces et al., 1999). Borl is 
therefore believed to catalyse oxidation of the C12-methyl group 
during borrelidin biosynthesis. In agreement with this the borl' 
mutant S. parvulus Tti4055/jborJ; :aac3 ( IV) fails to produce 
borrelidin but accumulates a new product 14 (figure 6) that is 
less polar than borrelidin. 14 is readily transformed to 
borrelidin when fed to the borK' mutant S. parvulus 
Ttt4055/JborK; :aac3(IV) which lacks the ability to synthesise the 
PKS starter unit but maintains the rest of the borrelidin 
biosynthetic genes intact. Fermentation of S. parvulus 
Tu4055/borJ; :aac3 (IV) followed by extraction and isolation 
provided -30 mg of 14 (example 18). Full structural analysis of 
14 identified it as 12-desnitrile-12-methylborrelidin (pre- 
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borrelidin) . This is consistent with the proposed role of Borl 
in borrelidin biosynthesis and provides a route to novel 
borrelidin analogues with a methyl group attached to C12 of the 
macrolactone ring. 

The putative PLP dependent aminotransferase BorJ is 
believed to catalyse the introduction of a nitrogen atom into 
borrelidin at the activated C28-position, probably via a C12- 
formyl moiety. In agreement with this the borJ~ mutant S. 
parvulus TU4055/JborJ": :aac3 (IV) does not produce borrelidin and 
accumulates a new compound that is more polar than borrelidin.. 
This new compound is not transformed to borrelidin when fed to 
mutant S. parvulus TU4055 / borE : :aac3 (IV) which indicates that it 
is probably a shunt metabolite rather than an intermediate in 
borrelidin biosynthesis. Fermentation of S. parvulus 
TiX^OSS/borJ: :aac3 (IV) allowed the isolation of 17 mg of the 
accumulated compound (example 19) . Detailed structural analysis 
identified the accumulant as 12-desnitrile-12-carboxyl 
borrelidin 2 . 

In addition to the compounds isolated from mutation of the 
borrelidin biosynthetic genes , a compound with the 
chromatographic and mass spectral characteristics consistent 
with 12-desnitrile-12-formyl borrelidin 15, has been observed in 
the fermentation supernatant of S. parvulus Tull3. The 
fermentation media and conditions used for these experiments 
differed from those we have described so far herein, but were 
designed to maximise the production of borrelidin. We propose 
that this altered medium, in combination with a drop in the 
dissolved oxygen concentration that was observed to occur during 
this specific fermentation, promoted the accumulation of 15. 

The above data lead us to propose one possible 
biosynthetic route to the nitrile moiety of borrelidin as 
presented in figure 6. The C12-methyl carbon of pre-borrelidin 
14 is first oxidised by Borl to introduce an allylic hydroxyl 
group at C28 (16) . We are presently unsure how this hydroxyl 
group is then converted to the formyl moiety attached to C12 
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(15) , but possibilities include spontaneous oxidation (or 
oxidation mediated by some background enzyme) or the action of a 
specific gene of the borrelidin biosynthetic gene cluster; 
candidate gene products are thus Borl itself, or alternatively, 
one of the oxidoreductase encoding genes such as borC or borK. 
The next step is anticipated to be Bor J-catalysed transamination 
of 15 in order to introduce a nitrogen atom at C28, in the form 
of an amine, through a pyridoxamine phosphate mediated process. 
The putative product amine 17 then undergoes oxidation, possibly 
spontaneously, but most probably by an enzymic activity such as 
that encoded by one of the oxidoreductase encoding genes borC or 
borK, or by a general oxidoreductase within the proteome . 

There are few pathways described in the literature for the 
biosynthesis of nitrile groups, these most commonly utilise 
inorganic cyanide (CIT) , which is known not to be the case for 
borrelidin (see above) . One route is described for the 
conversion of amines through to nitriles (Celenza, 2001; Kato et 
al., 2000a), which appears to be widespread in bacteria (Kato et 
al., 2000b). This pathway involves the oxidation of the amine 
to an aldoxime, which undergoes subsequent enzymatic 
dehydration. Potentially Borl and Bor J may catalyse the 
conversion of 12-methylborrelidin through to the putative 12- 
aminomethylborrelidin. This may then be used as a second 
substrate for Borl, both oxidising it to an aldoxime and 
dehydration to borrelidin A. 

In order to examine these proposed pathways in more 
detail, a number of biotransformation experiments were performed 
using pre-borrelidin 14 as substrate for investigating the 
action of JborX-K individually and in combination, using pEM4 as 
vector and S. albus J1074 (Chater & Wilde, 1980) as an 
expression strain. Expression of Jborl or bor J individually did 
not give borrelidin production on addition of 14. The added 14 
was only consumed during biotransformation with borl (and not in 
any of the control experiments); the 14 was identified as being 
converted to the shunt metabolite 2. However, co-expression of 
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borl & borJ did convert the added 14 to borrelidin, although not 
quantitatively- It may be that general proteome activities in S. 
albus are capable of oxidising the penultimate intermediate in 
the borrelidin biosynthetic pathway- Alternatively, the second 
pathway we describe may occur/ and Borl may be involved in both 
production of a 12-formyl group and the oxidation /dehydration 
of a subsequently formed aminomethyl group. 

In addition to the feeding of pre-borrelidin, 12- 
desnitrile-12-carboxyl borrelidin 2 was also fed to the three 
strains described above. No conversion of 2 to borrelidin was 
observed in any of these experiments , reinforcing the idea that 
2 is a shunt metabolite. 

Detailed investigation of genomic DNA from three 
borrelidin producing strains, S. rochei ATCC23956, S. parvulus 
TU113 and S. parvulus Tii4055, using numerous restriction digests 
and subsequent Southern Blot analysis, indicates that the 
borrelidin biosynthetic gene clusters of these three organisms 
are very closely conserved. It therefore appears that the 
borrelidin biosynthetic pathways of these strains are very 
similar. This assumption allows us to consider the data above, 
which are obtained from different strains, as applicable to a 
single biosynthetic pathway. 

It is clear to one skilled in the art that manipulation of 
the genes involved in formation of the C12-nitrile moiety of 
borrelidin, for example Jborl, or J, is a generally useful method 
for the production of novel borrelidin related molecules and 
borrelidin derivatives with altered functionality at C12 . In 
addition, the transfer of these genes to other organisms 
producing other natural or engineered polyketide products may 
allow the incorporation of nitrile moieties into such compounds . 

Other genes involved in borrelidin production 

In addition to the type-I terminal thioesterase domain of 
the borrelidin PKS, a discrete type-II thioesterase is located 
at the upstream boundary of the biosynthetic gene cluster and is 
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encoded by the gene borB. Such discrete type-II TE proteins are 
commonly found to be associated with type-I PKSs and are 
believed to play a role in the ^editing' of PKSs by the removal 
of short chain acyl groups that are formed by unwanted 
decarboxylation of extender units attached to KS domains 
(Heathcote et al., 2001). The disruption of such discrete type- 
II TEs in the picromycin (Xue et al., 1998) and tylosin (Butler 
et al., 1999) biosynthetic clusters leads to a significant 
reduction in titre of both macrolides. In accordance with these 
results, disruption of JborB gave a mutant that produced between 
43-75% of the parental wild type titre. 

The self-resistance of S. parvulus strains to borrelidin 
is most probably due to the product of JborO, which encodes a 
threonyl tRNA synthetase homologue. Threonyl-tRNA synthetase is 
the molecular target of borrelidin in sensitive strains (Paetz & 
Ness, 1973) . It is predicted that BorO is resistant to the 
action of borrelidin, and acts to produce threonyl-tRNAs in 
cells that make borrelidin, effectively complementing the normal 
threonyl-tRNA which are inhibited. To verify this hypothesis 
JborO was amplified by PCR and cloned in to the expression vector 
pEM4A, which puts JborO under the control of the strong 
constitutive promoter ermE* (example 20) . The resulting vector 
pJborOK was then transformed into the borrelidin-sensitive strain 
Streptomyces albus J1074 (Chater & Wilde, 1980) . Comparison of 
this strain with that containing only the expression vector 
pEM4A, using a soaked disk bioassay, clearly indicated that 
expression of jborO confers resistance to' borrelidin. 

EXAMPLES 

General methods 

Restriction enzymes, other molecular biology reagents, 
antibiotics and chemicals were purchased from standard 
commercial sources. Restriction endonuclease digestion and 
ligation followed standard methods (Sambrook et al., 1989). 
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Example 1: Fermentation of S. parvulus strains 

The following method is generally useful for culturing S. 

parvulus for the production of borrelidin and/or borrelidin 

analogues: 

5 A seed flask containing NYG medium (30 ml in a 250 ml 

Erlenmeyer flask) was inoculated from a working stock (0.5 ml) . 
NYG medium contains per litre of deionised water: beef extract 
(0.3 %) , Bacto peptone (0.5 %) , glucose (1 %) and yeast extract 
(0.5 %) . After 2 days shaking in a rotary incubator (2-inch 

10 throw; 30 °C; 250 rpm) the resulting cream culture was used to 
inoculate PYDG production medium (30 ml in a 250 ml Erlenmyer 
flask; 10 % innoculum) . PYDG medium contains p>*r litre of 
deionised water: peptonised milk nutrient (1.5 %) , yeast 
autolysate (0.15 %), dextrin (4.5 %) and glucose (0.5 %) 

15 adjusted to pH 7.0. After 5 days shaking on a rotary incubator 
(2-inch throw; 30°C; 250 rpm) the culture was harvested for 
analysis as described in example 4, or for isolation purposes as 
required. For quantitative analysis these experiments were 
performed in triplicate. 

20 

The following method is useful for the feeding of exogenous 
carboxylic acids to S. parvulus strains: 

The S. parvulus strain was grown as described above. After 
24 hours growth in PYDG production medium, the carboxylic acid 

25 of choice was added as a 50 \il single aliquot (0.6 M solution in 
70 % methanol after neutralization with 5 N NaOH) . The resulting 
culture was harvested after 5 days total fermentation and 
analysed as described in example 4. For quantitative studies 
these experiments were performed in triplicate, and the 

30 equivalent fed and unfed WT strains served as controls. 

Example 2: Cryopreservation of S. parvulus strains 
Working stocks 

* 

Working stocks of vegetative mycelia were prepared by 
35 mixing a 2 day old seed culture grown in NGY medium (0.5 ml) 
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with cryopreservative (0.5 ml). Cryopreservative consists of 20 
% glycerol and 10 % lactose in deionised water. 

Spore stocks 

Strains of S. parvulus were incubated on HA agar plates at 30°C. 
After 14 days the resulting spores from a single plate were 
harvested and suspended in of cryopreservative (1 ml) . HA agar 
contains per litre of deionised water: 0.4% yeast extract, 1% 
malt extract, 0.4 % dextrose and 1.5 % agar adjusted to pH 7.3. 

Example 3: Cloning of the borrelidin biosynthetic gene cluster 
and disruption of borA2 & JborA3 
Cosmid library generation 

A cosmid library was constructed in pWEl5 cosmid vector 
using the Gigapack® III Gold Packaging Extract kit according to 
the manufacturers handbook (Stratagene) . Chromosomal DNA was 
extracted from S. parvulus Tti4055 according to standard 
protocols (Kieser et al., 2000) and treated with Sau3AI prior to 
cloning into pWElS. A number of the resulting E. coli 
20 transformants (3300) were picked and transferred to 96 well 

microtitre plates containing Luria Broth (LB) medium (0.1 ml per 
well) with ampicillin (100 ug/ml) . The resulting clones were 
replica-plated to Luria agar (LA) plates containing ampicillin 
(100 ug/ml). After incubation overnight at 37°C colonies were 
25 transferred to nylon membrane filters for in situ colony 
hybridization analysis according to published protocols 
(Sambrook et al., 1989). 



Library screening 

30 The cosmid library was screened using a probe that was 

generated using the DIG DNA Labelling and detection kit (Roche) 
according to the manufacturers instructions. The probe used was 
a Bglll-BamHI fragment (1.7 kbp) obtained from the gene that 
encodes module 6 of the third subunit of the oleandomycin PKS 

35 from Streptomyces antibioticus (Swan et al., 1994). 
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Disruption of the borrelidin biosynthetic gene cluster 

Cosmids that gave a positive response when screened as 
described above were digested with BamRI and fragments of less 
5 than 3 kbp were subcloned into pOJ260 (Bierman et al., 1992). 
These were then used to transform protoplasts of S. parvulus 

♦ 

Tti4055 as described in example 5. The resulting trans formants 
were then assessed for the ability to produce borrelidin. Two 
clones were borrelidin non-producers; both were obtained from 
10 cosBor32A2 and contain sequence typical of a modular PKS. The 

remaining cosmids were then screened using probes obtained from 

» 

the two BajnHI fragments , which led to the identification of the 
overlapping cosmid cosBorl9B9 that contained the remainder of 
the borrelidin biosynthetic cluster. 

15 

Sequencing of cosBor32A2 and cosBor!9B9 

The cosmids cosBor32A2 and cosBorl9B9 were transformed 
into E. coli DH10B and the resulting clones grown at 37 °C in 
2xTY media (30 ml) containing ampicillin. After 15 hours the 

20 cells were harvested and Qiagen Tip 100 kits were used to 

prepare cosmid DNA. Approximately 5 ]ig of the cosmid DNA was 
digested with Sau3AI (1 U) . Samples were taken at 2, 4, 6, 8 & 
10 minute intervals after the enzyme was added and quenched into 
an equal volume of ice cold 0.5M EDTA. The samples were mixed 

25 and then analysed by gel electrophoresis , and those fragments 
between 1.5-2.0 kbp recovered from the gel. The fragments were 
cloned into linearised and dephosphorylated pHSG397 (Takeshita 
et al., 1987) , and transformed into E. coli DH10B. The resulting 
clones that contained insert were grown in 2xTY medium (2 ml) 

30 containing chloramphenicol (30 \xg/ml) and purified using Wizard 
kits (Promega) . 

DNA sequencing was carried out using an Applied Biosystems 
800 Molecular Biology CATALYST robot to perform the dideoxy 
terminator reactions, which were then loaded into an ABI Prism 

35 3700 automated sequencer (Applied Biosystems) . The raw sequence 
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data was processed using the Staden software package. Assembly 
and contig editing was performed using GAP (Genome Assembly 
Program) version 4.2 (Bonfield et al., 1995). The GCG package 
(Devereux et al., 1984) version 10.0 was used for sequence 
analysis . 



Example 4: Chemical analysis of S. parvulus strains 

The following method is useful for analysing fermentations 
(see example 1) for the production of natural borrelidins and of 
engineered borrelidin analogues: 

In a 2 ml Eppendorf tube, an aliquot of 5 day old 
fermentation broth (1 ml) was adjusted to pH~3 by the. addition 
of 90 % formic acid (ca. 20 pi) . Ethyl acetate (1 ml) was added 
to the sample and mixed vigorously for 10 min using a vortex 
tray. The mixture was separated by centrifugation in a microfuge 
and the upper phase removed to a clean 2 ml Eppendorf tube. The 

* 

ethyl acetate was removed by evaporation using a Speed-Vac. 
Residues were dissolved into methanol (250 ul) and clarified 
using a microfuge. Analysis was performed on an Agilent HP1100 
HPLC system as describe below: 
Injection volume: 5.0 ]il 

Column stationary phase: 150 x 4.6 mm column, base- 
deactivated reversed phase silica gel, 3 pm particle size 
( Hypers il Cie-BDS) . 

Mobile phase A: 10 % acetonitrile : 90 % water, containing 
10 mM ammonium acetate and 0-1 % TFA. 

Mobile phase B: 90 % acetonitrile : 10 % water, containing 
10 mM ammonium acetate and 0.1 % TFA. 

Mobile phase gradient: T=0 min, 25%B; T=15, 100%B; T=19, 
100%B; T=19.5, 25%B; T=25, 25%B. 
Flow rate: 1 ml/min. 

Detection: UV at 258 nm (DAD acquisition over 190-600 nm) ; 
MS detection by electrospray ionisation over m/ z range 
100-1000 amu, with +/-ve ion mode switching. 
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Example 5; Protoplast transformation protocol for S. parvnlus 
Tu4055 

A seed flask containing tryptone soy broth (TSB) medium 
(10 ml in a 100 ml Erlenmyer flask) was inoculated from a 
5 working stock (0.15 ml). After 3 days shaking on a rotary 
incubator (30°C, 250 rpm) , 5 ml of the culture was used to 
inoculate R5 medium (Kieser et al., 2000) (50 ml in a 250 ml 
Erlenmeyer flask) that was then shaken on a rotary incubator for 
24 hours (30°C, 250 rpm) . The PEG mediated transformation of 
10 protoplasts was then performed according to standard published 
protocols (Kieser et al., 2000). 

Example 6: Replacement of borAT4 with rapAT2 - production of 
ClO-desmethyl borrelidin 

15 The borrelidin PKS AT 4 domain is replaced with the AT 2 domain 

of the rapamycin polyketide synthase as follows : 

CosBor32A2 is digested with EcoRI and the 5429 bp band 
isolated. This is used as a template for PGR using the oligos 
CM410 (5' -AAAATGCATTCGGCCTGAACGGCCCCGCTGTCA-3 f ) (SEQ ID No. 45) 

20 and CM411 ( 5' -AAATGGCCAGCGAACACCAACACCACACCACCA-3' ) (SEQ ID 

No. 46). CM410 introduces an Nsil restriction site for cloning 
purposes and CM411 introduces an MscI site for use in the 
introduction of a heterologous AT. The -1.1 kbp product is 
cloned into pUC18 digested with Smal and dephosphorylated. The 

25 insert can ligate in two orientations and the reverse 

orientation is screened for by restriction enzyme analysis and 
the insert sequenced. One correct plasmid is designated pCJM462. 
Methylation deficient DNA (specifically dcm") of pCJM462 and 
pCJR2 6 (Rowe et al. 1998) is isolated by passaging the plasmids 

30 through E. coll ET12567. Each plasmid is then digested with MscI 
and Xbal and the -7.8 kbp fragment from pCJR26, containing the 
rapamycin AT 2 and sequences downstream in pCJR26, is ligated to 
the -3.8 kbp backbone generated by digestion of pCJM462. Plasmid 
pCJM463 is identified by restriction analysis. 
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CosBor32A2 is digested with EcoRl and BcoRV and the 2871 bp 
band isolated. This is used as a template for PCR using the 
oligos CM412 (5' -AAAGTCCTAGGCGGCGGCCGGCGGGTCGACCT-3' ) (SEQ ID 
No. 47) and CM413 (5' -TTTAGATCTCGCGACGTCGCACGCGCCGAACGTCA-3' ) 
(SEQ ID No. 48). CM412 introduces an Avrll restriction site that 
joins, in frame, the downstream borrelidin homology to the 
heterologous AT, and CM413 introduces a'Bglll site for cloning 
purposes. The -1.1 kbp product is cloned into pUC18 digested 
with Smal and dephosphorylated. The insert can ligate in two 
orientations and the reverse orientation is screened for by 
restriction enzyme analysis and the insert sequenced. One 
correct plasmid is designated pCJM464 i 

Plasmids pCJM463 and pCJM464 are digested with Avrll and Xbal 
and the -1.1 kbp fragment from pCJM4 64 is ligated into the -4.7 
kbp backbone of pCJM463 to give pCJM465, which is identified by 
restriction enzyme analysis. pCJM465 contains the hybrid 
rapamycin AT 2 with flanking regions of borrelidin sequence which 
provide homology for integration and secondary recombination. 

Plasmid pCJM465 is digested with Nsil and Bglll and the -3 
kbp fragment is cloned into pSLH80 previously digested with 
Nsil and BamHI to give pCJM466. Plasmid pCJM4 66 is then digested 
with Nsil and the apramycin cassette is incorporated on a PstI 
fragment from pEFBA (Lozano et al. 2000) to give the replacement 
vector pCJM467. pCJM467 is introduced into S. parvulus TU4055 by 
protoplast transformation as described in example 5. Colonies 
resistant to apramycin (25 ug/ml) are initially identified, and 
then passaged several times through MA media without antibiotic 
selection in order to promote the second recombination 
(Fernandez et al. 1998). Several apramycin-sensitive colonies 
are isolated and analysed by PCR and Southern blot. The new 
mutant is named S. parvulus Tu4055/467. 

S. parvulus TU4055/467 is analysed as described in example 1 
and shown to produce a mixture of compounds with the correct OV 
spectrum. One of the new major components that is more polar 
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than borrelidin has the correct retention time for 10-desmethyl 
borrelidin 3. LCMS analysis indicates an m/z ratio for a 
compound that is 14 mass units lower than borrelidin as 
expected, and with an appropriate mass fragmentation pattern. 
Borrelidin itself is also produced, but at levels lower than the 
WT organism. 

Example 7 : Mutation of the methylmalonyl-CoA selective motif of 
borAT4 to generate 10-desmethyl borrelidin 

Site directed mutagenesis of acyl transferase domains may 
also be used to alter the specificity of an AT. In this example 
the specificity of borAT4 is directed from methyl-malonyl-CoA 
towards malonyl-CoA. An amino acid motif has been identified 
(Reeves et ai., 2001; WO 02/14482) which directs the specificity 
of an AT. The motif YASH, as observed in borAT4, is found in 
methylmalonyl-CoA specific ATs and in this example it is altered 
to HAFH which is found in malonyl-CoA specific ATs. 

CosBor32A2 is digested with Ncol and the 5167 bp band 
isolated. This is used as a template for PCR using the primers 
CM414 (5' -AAACTGCAGAGTCGAACATCGGTCACACGCAGGC-3' ) (SEQ ID No. 49) 
and CM415 (5' -AAAATGCATGATCCACATCGATACGACGCGCCCGA-3 r ) (SEQ ID 
No. 50). CM414 introduces a PstI restriction site for cloning 
purposes, and CM415 is a mutagenic primer covering the motif 
encoding region of the AT which will effect the amino acid 
changes and contains an Nsil site for cloning purposes. The -1.1 
kbp fragment is cloned into pUC18 digested with Smal and 
dephosphorylated. The insert can ligate in either orientation 
and the forward orientation is screened for by restriction 
enzyme analysis and the insert sequenced. One correct plasmid is 
designated pCJM468. 

A second PCR reaction is performed using the 5167 bp Ncol 
fragment of CosBor32A2 and the primers CM416 (5'- 
TAAATGCATTCCATTCGGTGCAGGTGGAGTTGATCC-3 ' ) (SEQ ID No. 51) and 
CM4 17 ( 5 ' -ATAGGATCCCCTCCGGGTGCTCCAGACCGGCCACCC-3 ' ) ( SEQ ID 
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No. 52). CM416 introduces an Nsil restriction site and is also a 
mutagenic primer covering the motif encoding, region of the AT, 
and CM417 introduces a BamHI site for cloning purposes. The -1.1 
kbp fragment is cloned into pUC18 previously digested with Smal 
and dephosphorylated. The insert can ligate in two orientations 
and the forward orientation is screened for by restriction 
enzyme analysis and the insert sequenced. One correct plasmid is 

designated pCJM469. 

Plasmids pCJM468 and pCJM469 are digested with Nsil and Xbal 
) and the -1.1 kbp fragment from pCJM468 is ligated into the -3.8 
kbp backbone of P CJM469 to give pCJM470, which is identified by 
restriction enzyme analysis. P CJM470 contains the mutated motif 
of borAT4 with -1.1 kbp of homologous DNA on either side which 
provide homology for integration and secondary recombination. 
15 Plasmid pCJM470 is digested with PstI and BamHI and the -2.2 

kbp fragment is cloned into pSL1180 (Amersham Biosciences) 
previously digested with PstI and BamHI to give P CJM471. Plasmid 
' pCJM471 is then digested with PstI and the apramycin cassette is 
incorporated on a PstI fragment from pEFBA (Lozano et al., 2000) 
20 to provide the replacement vector pCJM472. 

The replacement vector pCJM472 is introduced into S. parvulus 
Tu4055 by protoplast transformation as described in example 5. 
Colonies resistant to apramycin are initially identified, and 
then passaged several times through MA media without antibiotic 
25 selection in order to promote the second recombination 

(Fernandez et al., 1998). Several apramycin-sensitive colonies 
are isolated and analysed by PCR and Southern blot, and one is 
selected that contains the new AT 4 sequence containing the 
mutated motif and the Nsil site. The new mutant is named S. 

30 parvulus TU4055/472. 

S. parvulus Tu4055/472 is grown and analysed as described in 
example 1 and shown to produce a mixture compounds with the 
correct UV for borrelidin. One of the new major components that 
is more polar than borrelidin has the correct retention time for 

35 authentic 3. LCMS analysis indicates an m/z ratio for a compound 
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that is 14 mass units lower than borrelidin as expected, and 
with an appropriate mass fragmentation pattern, Borrelidin 
itself is also produced, but at levels lower than the WT 
organism, 

5 

Example 8: Introduction of the borrelidin loading module into 
the erythromycin PKS 

The borrelidin loading module was amplified for each of 
the four putative start codons . The PCR template was a 337 6 bp 

10 BawEl fragment of cosBor32A2 covering the region from 

nucleotides 15858 to 19234 of SEQ ID No . 1 . The reverse primer 
CM368 (5' -TTTCCTGCAGGCCATCCCCACGATCGCGATCGGCT— 3 ' ) (SEQ ID No: 53) 
introduces a Sjb^I site at the sequence corresponding to the 
start of KSl of borA2 (conserved MACRL motif) and is used with 

15 each of the forward primers CM369 (5 f - 

TTTCATATGACAGGCAGTGCTGTTTCGGCCCCATT-3' ) (SEQ ID No. 54), CM370 
(5 f -TTTCATATGGCGGATGCCGTACGTGCCGCCGGCGCT— 3 f ) (SEQ ID No. 55), 
CM371 (5 f -TTTCATATGCCCCAGGCGATCGTCCGCACCAC-3 ' ) (SEQ ID No. 56) 
and CM372 (5' -TTTCATATGGTCTCGGCCCCCCACACAAGAGCCCTCCGGGC-3' ) (SEQ 

20 ID No:57). The four PCR products (of 2834, 2720, 2411 and 2117 

bp respectively) were cloned into pUC18 that had previously been 
digested with Smal and dephosphorylated. The resulting plasmids 
were designated pCJM370, which contains the largest insert, 
PCJM371, pCJM372 and pCJM373, which contains the smallest 

25 insert. 

The four borrelidin loading module fragments were 
introduced into the vector pKSlW, which contains a PstI site at 
the start of eryKSl of DEBS1-TE in the conserved MACRL motif 
(Rowe et al., 2001); PstI gives the same overhang as Sbfl. pKSlW 
30 is a pT7-based plasmid containing DEBS1-TE on an Ndel/Xbal 
fragment, with unique sites flanking the loading module, a 
unique PstI site at nucleotide position 1698 of the DEBS1-TE 
encoding gene and a unique Wdel site at the start codon. The 
borrelidin loading module fragments were excised as follows: 
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PCJM370 was digested with Ndel and SJb.fi, pCJM371 and pCJM373 
were digested with Ndel and Pstl, and pCJM372 was digested with 
Ndel, Pstl and Dral. Each loading module containing fragment was 
cloned into pKSlW previously digested with Ndel and Pstl. The 
5 resulting, plasmids were designated pCJM384, which contains the 
largest insert, then pCJM386, pCJM388 . and pCJM390, which 
contains the smallest insert. 

The hybrid PKS fragments were transferred into pCJR24, 
which is a suitable vector for transformation of S. erythraea WT 
10 and S. erythraea DM, and for expression of the resulting hybrid 
PKS (WO 98/01546) . Each loading module construct was excised 
along with a 2346 bp fragment of DNA from DEBSl in order to 
allow integration into the chromosome. In order to achieve this, 
pCJR24 is digested with Xbal and end-filled using Klenow 
15 fragment of DNA polymerase I. This is then digested with Ndel to 
give the backbone fragment. Into this, the four hybrid PKS 
fragments containing the borrelidin loading modules plus the 
region of DEBSl sequence for integration are cloned as 
Ndel/EcoRV fragments from pCJM384, pCJM386, P CJM388 and pCJM390 
20 to give pCJM400, pCJM401, pCJM402 and pCJM403 respectively. 

Plasmids pCJM400, pCJM401, pCJM402 and pCJM403 were 
introduced into S. erythraea by transformation of S. erythraea 
DM protoplasts as described elsewhere (Gaisser et al., 2000). 
The resulting mutants were analysed by PCR and Southern blot to 
25 confirm the presence of the plasmid on the chromosome and to 
establish that correct integration had occurred. A number of 
mutants that appeared correct by these methods were grown, 
extracted and analysed according to standard methods for 
polyketide production from S. erythraea strains (Wilkinson et 
30 al., 2000). When compared to control strains using LCMS methods, 
the extracts from several of these mutants contained new 
compounds at reasonable levels. Analysis of their MS spectra 
showed the presence of a compound with m/z = 485.3 ( [M-H] ~, 6) 
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in negative ion mode. This is in agreement with the expected 
product compound (M = 48 6.3) . 

Example 9; Fusion of PKS modules 4 and 5 (S. parvulus 
5 Ttt4055/jborA4-A5) 

To examine the iterative action of module 5, the two 
separate proteins encoding modules 4 and 5 were fused together 
through manipulation at the genetic level. The fusion was 
performed by a gene replacement in which the last -1 kbp of 

10 borA4 and the first ~1 kbp of borAS, were fused by converting 
the overlapping stop and start codons respectively into an 
arginine residue, introducing a new Xbal site and converting the 
two separate orfs into one. 

In the first step of the mutagenesis, two separate PCR 

15 amplifications were performed. The template DNA was cosBorl9B9, 
and used the primers B1819A (5 1 -GTCATGCATGCGGCGGGCTC-3 ' ) (SEQ ID 
No. 58) and B1819B (5 1 -GGTCTAGAACGGCCGAACTT-3 1 ) (SEQ ID No-59). 
The 1063 bp product was purified, digested Nsil-Xbal and cloned 
into pSL1180 (Amersham Biosciences) digested similarly to give 

20 plasmid pSL18-19AB. The second PCR reaction amplified the borA5 
fragment and used the primers B1819C (5 1 -GTTCTAGAACCTCGGTCGGC- 
3') . (SEQ ID No. 60) and B1819D (5 1 -CTGGATCCCACGCTGCTGCG-3 1 ) (SEQ 
ID No. 61). The 1033 bp product was purified, digested Xbal-BamHI 
and cloned into pSL18-19AB that had been digested similarly, to 

25 give plasmid pSL18-ABCD. Finally, the apramycin cassette from 
pEFBA (Lozano et al., 2000) was excised as a PstI fragment and 
cloned into pSL18-19ABCD digested with NslI to give the 
replacement vector pSL18-19Apra. 

The replacement vector pSL18-19Apra was introduced into S. 

30 parvulus TU4055 by protoplast transformation as described in 
example 5. Colonies resistant to apramycin (25 >ig/ml) were 
initially selected, and then passaged several times through MA 
media without selection. Several apramycin-sensitive colonies 
were obtained, two of which produced borrelidin while the others 

35 did not. 
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Chromosomal DNA was extracted from all of the apramycin 
sensitive colonies and checked initially by PCR using the 
primers BLDA (5 ' -GGAGACTTACGGGGGATGC-3 1 ) (SEQ ID No. 62) and BLDB 
(5--CTCCAGCAGCGACCAGAAC-3') (SEQ ID No. 63) that are selective 
for the loading module (JborAl) . A 2.9 kbp fragment was observed 
for the control and the two borrelidin-producing mutants, but 
not for the non-producing strains. This result is symptomatic 
and characteristic of non-specific deletions in the chromosome. 

The two borrelidin-producing colonies were analysed 
further by PCR using the primers B19A (5« -CCCATGCATCACCGACATAC- 
3') (SEQ ID No. 64) and B19B (5 1 -GCGATATCCCGAAGAACGCG-3 ' ) (SEQ ID 
No. 65) in order to check the fusion site. The method was as 
described above. Both the colonies and the controls gave a PCR 
product of 1010 bp, but upon digestion with Xbal only those that 
carried the fusion-producing mutation gave digestion to 600 and 
400. bp fragments.. Only one of the borrelidin-producing colonies 
harboured the fusion, while the other had reverted to wild type. 
Final confirmation came from Southern analysis using a BamHI- 
Xhol internal fragment from borAS as probe over chromosomal DNA 
digested with Xbal and Bell. The control and wild type revertant 
colony showed a fragment of 11.5 kbp as expected, while the 
fusion mutant showed a . fragment of 7.8 kbp as expected. This new 
mutant was named 5. parvulus Tii4055/JborA4-A5. S. parvulus 
Tii4 055 /horAA -A5 was shown to produce borrelidin at 26±5% of the 
WT titre, following the protocol described in example 1. 

Example 10: Fusion of PKS modules 5 and 6 (S. parvulus 

Tu4 0 5 5 /borA5 -A6) 

This experiments was performed for the same reason as, and 
in an analogous manner to, that of example 6 above. The fusion 
of these orfs introduces an additional leucine residue into the 
new protein at the fusion point, in addition to a new Spel site 
at the genetic level. In the first step of the process two PCR 
fragments were generated using cosBorl9B9 as template. The first 
PCR reaction amplified the JborA5 region and used the primers 
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B1920A (S'-GCCAAGCTTCCTCGACGCGC-S' ) (SEQ ID No. 66) and B1920B 
(5 1 -CACTAGTGCCTCACCCAGTT-3 1 ) (SEQ ID No. 67), The 804 bp product 
was purified and digested with Hindlll-Spel . The second PCR 
reaction amplified the borAS region and used the primers B1920C 
(5 1 -CACTAGTGACGGCCGAAGCG-3 ' ) (SEQ ID No. 68) and B1920D (5'- 
TCGGATCCGTCAGACCGTTC-3 1 ) (SEQ ID No. 69). The 960 bp product was 
purified and digested with Spel-BamHI. The two purified and 
digested gene products were then cloned together into pOJ260 
that had been digested with Hindlll-BajnHI to give the 
replacement vector pOJF19-20. pOJF19-20 was introduced into S. 
parvulus Tii4055 by protoplast transformation to give apramycin 
resistant colonies. One such colony was passaged several times 
through MA media without selection in order to promote double 
recombination. Two apramycin sensitive colonies were obtained, 
and chromosomal DNA from these was examined by Southern 
hybridisation to check for the presence of a 3.2 kbp BamHI . 
fragment (to control for unwanted deletions in the loading 
module) and a 3.4 kbp Spel-BajnHI fragment to verify correct 
introduction of the borA5-A6 fusion (5.8 kbp BamHI fragment in 
the WT) . One of the apramycin colonies carried the correct 
mutation without deletion and was named S. parvulus 
T114 055/ borAS ~A6. S. parvulus TU4055 /borA5-A6 was shown to 
produce borrelidin at 25±4% of the WT titre, following the 
protocol as described in example 1. 

Example 11: Replacement of the erythromycin PKS module 4 with 
module 5 of the borrelidin PKS - production of ring expanded 
macrolides 

Example 11 describes the replacement of erythromycin module 4 
with borrelidin module 5. Borrelidin module 5 is believed to be 
responsible for three rounds of condensation of methylmalonyl- 
CoA, in an iterative fashion, within the borrelidin PKS. 
Previously, erythromycin module 4 has been shown to occasionally 
act in an iterative fashion *mis' -incorporating a second 
methylmalonyl-CoA to make very small amounts of a 16-membered 
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macrolide from the erythromycin PKS. A strain in which the 
erythromycin module 4 is replaced by borrelidin module 5 is 
engineered by a replacement strategy as follows, and is based on 
a derivative process as described for module insertion into the 
5 erythromycin PKS (Rowe et al., 2001): 

Initially a series of plasmids are made in order to generate 
a plasmid in which the borrelidin module 5 is flanked by 
appropriate regions of homology from the erythromycin PKS. In 
order to facilitate this, the Sbfl site is first removed from 
10 the polylinker of pUC18 by digestion with PstI, end-polishing 
with T4 polymerase and religation. The new plasmid, pCJM409 is 
identified by restriction enzyme digestion. 

Borrelidin module 5 is isolated on an Sbfl fragment by 
ligating together 4 PGR fragments. PCRA is generated by 
15 amplification of -1.4 kb of the beginning of borrelidin module 5 
using the 6062 bp Xcml fragment of cosBorl9B9 as the template 
and primers CM384 (5' -AACCTGCAGGTACCCCGGTGGGGTGCGGTCGCCCGA-3' ) 
(SEQ ID No. 70) and CM385 (5' -CGCCGCACGCGTCGAAGCCAACGA— 3' ) (SEQ 
ID No. 71). CM384 introduces an SbfT site in the conserved amino 
20 acid sequence MxCR at the beginning of borrelidin module 5. 

CM385 incorporates a naturally occuring MIuI site that is used 
in the cloning strategy. PCRA is treated with T4 polynucleotide 
kinase (T4 PNK, NEB) and cloned into pCJM409 previously digested 
with Smal and dephosphorylated with Shrimp Alkaline Phosphatase 
25 (SAP, Roche) . Inserts cloned in the forward direction are 
screened for by restriction enzyme digestion, and for one 
correct clone the insert is verified by sequencing. This plasmid 

is designated pCJM410. 

PCRB is generated by amplification of the adjacent -1.4 kb 

30 of borrelidin module 5 using the 6062 bp Xcml fragment of 
cosBorl9B9 as the template and primers CM386 (5'- 
TGTGGGCTGGTCGTTGGCTTCGAC— 3' ) (SEQ ID No. 72) and CM387 (5'- 
GGTGCCTGCAGCGTGAGTTCCTCGACGGATCCGA-3 ' ) (SEQ ID No. 73) . CM386 
binds upstream of the same Mlul site as CM385 contains, which is 

35 used in the cloning strategy. CM387 is used to remove the Sbfl 



76 



site within the borrelidin PKS module 5 whilst leaving the 
overlapping PstI site for cloning. PCRB is treated with T4 PNK 
and cloned into pCJM409 previously digested with Sinai and 
dephosphorylated with SAP. Inserts cloned in the forward 
direction are screened for by restriction enzyme digestion, and 
for one correct clone the insert is verified by sequencing. This 
plasmid is designated pCJM411. 

PCRC is generated by amplification of the downstream 
adjacent -1.5 kb of borrelidin module 5 using the 6062 bp ' Xcml 
fragment of cosBor!9B9 as the template and oligonucleotides 
CM388 (5' -GAGGAACTCACCCTGCAGGCACCGCT-3 ' ) (SEQ ID No. 74) and 
CM395 ( 5 9 -CGAACGTCCAGCCCTCGGGCATGCGT-3 ' ) (SEQ ID No . 75) . CM3C8 
binds at the same Sixfl site as CM387, but is not mutagenic and 
retains the Sbfl site. CM395 incorporates an SphI site for 
cloning purposes. PCRC is treated with T4 PNK and cloned into 
pCJM409 previously digested with Smal and dephosphorylated with 
SAP. Inserts cloned in the forward direction are screened for by 
restriction enzyme digestion and for one correct clone the 
insert is verified by sequencing. This plasmid is designated 
pCJM412. 

PCRD is generated by amplification of the downstream 
adjacent -2.1 kb of borrelidin module 5 using the 7211 bp BJbvCI 
fragment of cosBorl9B9 as the template and oligonucleotides 
CM396 (5' -TGGCACGCATGCCCGAGGGCTGGACGTT-3' ) (SEQ ID No. 76) and 
CM 3 97 ( 5 ' -TTTCCTGCAGGCCATGCCGACGATCGCGACAGGCT - 3 ' ) ( SEQ ID 
No. 77) 1 CM396 contains the SphI site for cloning purposes, and 
CM397 introduces an Sbfl site in the conserved amino acid 
sequence MxCR at the end of borrelidin module 5. PCRD is treated 
with T4 PNK and cloned into pCJM409 previously digested with 
Smal and dephosphorylated with SAP. Inserts cloned in the 
forward direction are screened for by restriction enzyme 
digestion, and for one correct clone the insert is verified by 
sequencing, this plasmid is designated pCJM413. 

The four PCR products (PCRA-D) are used to construct the 
borrelidin module 5 on an Sbfl fragment as follows : 
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pCJM412 is digested with SphI and the -1.5 kb fragment 
isolated is cloned into pCJM413 previously digested with SphI 
and dephosphorylated with SAP. This gives plasmid pCJM414, which 
is identified by restriction enzyme digestion. 
5 pCJM414 is digested with Sbfl and the -3.6 kb fragment 

isolated is cloned into P CJM411 previously digested with PstI 
and dephosphorylated with SAP. This gives pCJM415 which is 
identified by restriction enzyme digestion. 

pCJM410 is digested with AttuI and ffindllll and the -1.4 kb 
10 fragment isolated is cloned into pCJM415 previously digested 

with Mlul and JTindllll. This gives pCJM416, which is identified 
by restriction enzyme digestion. pCJM416 is a pUC18-based 
plasmid containing the borrelidin module 5 as an Sbfl fragment. 
In order to introduce the Borrelidin module 5 into the 
15 erythromycin PKS by a replacement strategy, flanking regions of 
homology from the erythromycin PKS are incorporated for 
recombination as follows: 

PCRE is generated by amplification of -3.3 kb of the 
erythromycin PKS directly upstream of the module 4 KS using the 
20 6428 bp XmnI fragment of pIB023 as the template and primers 
CM398 (5' -AAACATATGGTCCTGGCGCTGCGCAACGGGGAACTG-3' ) (SEQ ID 
No. 78) and CM399 (5' -TTTCCTGCAGGCGATGCCGACGATGGCGATGGGCT— 3' ) 
(SEQ ID No. 79). CM398 contains an Wdel site for cloning purposes 
and CM399 introduces an Sbfl site in the conserved amino acid 
25 sequence M/IxCR at the beginning of erythromycin module 4. PCRE 
is treated with T4 PNK and cloned into pCJM409 previously 
digested with Smal and dephosphorylated with SAP. Inserts cloned 
in the forward direction are screened for by restriction enzyme 
digestion, and for one correct clone the insert is verified by 
30 . sequencing, this plasmid is designated pCJM417. 

PCRF is generated by amplification of -3.4 kb of the 
erythromycin PKS directly downstream of the module 5 KS using 
the 7875 bp Xmnl/Nhel fragment of pIB023 as the template and 
primers CM400 ( 5 ' -AAACCTGCAGGTTCCCCGGCGACGTGGACTCGCCGGAGTCGTT- 

35 3') (SEQ ID No. 80) and CM401 (5'- 
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TTTTCTAGAGCGACGTCGCAGGCGGCGATGGTCACGCCCGT-3' ) (SEQ ID No. 81). 
CM400 introduces an Sbfl site in the conserved amino acid 
sequence M/IxCR at the beginning of erythromycin module 4, and 
primer CM401contains an Xbal site for cloning purposes. PCRF is 
5 treated with T4 PNK and cloned into pCJM409 previously digested 
with Smal and dephosphorylated with SAP. Inserts cloned in the 
forward direction are screened for by restriction enzyme 
digestion, and for one correct clone the insert is verified by 
sequencing. This plasmid is designated pCJM418. 

10 pCJM417 is digested with Ndel and Sbfl and the -3.3 kb 

fragment is cloned into pCJM418 digested with Ndel and Sbfl 
(-5.8 kbp) to give pCJM419 which is identified by its 
restriction digest pattern. pCJM419 contains a unique Sbfl site 
which can be used to accept any complete module with Sbfl (or 

15 Pstl) flanking sites appropriate to place, in-frame, the in- 
coming module exactly into the conserved region of the KS 
domain . 

The borrelidih module 5 with flanking Sbfl sites is cloned 
from pCJM416 as an SJbfl fragment into the unique Sjb.fi site of 

20 pCJM419 (which has been dephosphorylated with SAP) to give 

pCJM420, which is identified by restriction enzyme analysis to 
confirm the presence and correct orientation of the insert. 
pCJM420 thus contains borrelidin module 5 with flanking regions 
of homology to introduce it in-frame between modules 3 and 5 of 

25 the erythromycin PKS. The complete insert is removed as an 

Ndel/Xbal fragment from pCJM420 and cloned into pCJM24 digested 
with Ndel and Xbal to give the final plasmid pCJM421. pCJR24, 
and consequently pCJM421, contain an appropriate resistance 
marker for selection of S. erythraea transf ormants . 

30 Plasmid pCJM421 is used to transform S. erythraea strains 

NRRL2338 (wild type), and S. erythraea DM (eryCIir, eryBV) 
protoplasts (Yamamoto et al., 1986; Rowe et al., 1998). 
Integrants are selected for resistance to thiostrepton (50mg/L) 
and a number of integrants (typically 5-8) are analysed further 

35 by Southern blot to confirm that the strains are correct and to 
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identify the site of integration. Two correct integrants in each 
case are sub-cultured in TSB liquid media without antibiotic 
selection in order to promote the second recombination. Several 
thiostrepton-sensitive colonies are isolated and analysed by PCR 
and Southern blot, and in each case one selected that contains 
the new module correctly inserted. This leads to strains S. 
erythraea WT/421 and S. erythraea DM/421. 

Strain S. erythraea DM/421 is cultured under conditions 
appropriate for the production of erythronolides (Wilkinson et 
al., 2000). Analysis of fermentation broth extracts using LCMS 
methods indicates the presence of two new significant peaks when 
compared to the control strain, and which are less polar than 
erythronolide B. These have an m/z of 435.5 (MNa + ) and 477.5 
(MNa + ) respectively, which is consistent with the production of 
new ring expanded erythronolide B analogues. The compound with 
m/z = 435.5 (7) is consistent with the presence of the 16- 
membered ring-expanded erythronolide B related macrolide 
reported previously as a minor component of S. erythraea WT 
fermentations (Wilkinson et al., 2000); the compound with m/z = 
477.5 (8) is consistent with the presence of an 18-membered, 
doubly ring-expanded erythronolide B related macrolide (see 
figure 8) . It is clear to one skilled in the art that such new 
products can be converted to antibacterial molecules by 
biotransformation with an appropriate organism, or through the 
fermentation of the strain S. erythraea WT/421. It is further 
clear to one skilled in the art that the inclusion of such a 
module into other positions of the erythromycin PKS or into 
other PKSs may allow the production of novel, ring expanded 
polyketides in a similar manner. 

An alternative strategy for generating this hybrid PKS is 
to incorporate the borrelidin module 5 in place of erythromycin 
module 4 within a large plasmid that contains the entire hybrid 
PKS, followed by transformation of an eryA' S. erythraea strain. 
Such an appropriate existing eryA" is S. erythraea JC2 (Rowe et 
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al., 1998) and the plasmid containing the eryA genes under the 
actl promoter, pIB023 that also contains a thiostrepton 
resistance gene and the act JJ-0RF4 activator. This strategy is 
accomplished as follows: 

pIB023 is digested with Ndel and Bsml and the 13.4 kbp 
fragment is cloned into pCJM419 digested with Wdel and Bsml to 
give plasmid pCJM425. pIB023 is digested with BjbvCI and Xbal and 
the approx. 6 kbp fragment is cloned into pCJM425 digested with 
BJbvCI and Xbal to give plasmid pCJM426. The Ndel/Xbal fragment 
from pCJM426 is cloned into pCJM395 digested with Ndel and XJbal. 
pCJM395 is a plasmid made by digesting pCJR24 with SJbfl, end- 
polishing with T4 polymerase and religating, to give a version 
of pCJR24 that does not cut with Sbfl. The resulting plasmid, 
pCJM427 f contains an engineered version of the erythromycin PKS 
in which module 4 is removed. This backbone is then ready to 
accept any complete module with appropriate flanking sites (Sbfl 
or PstI) to generate a hybrid PKS. Introduction of the single 
borrelidin module 5 is accomplished by digesting pCJM427 with 
Sbfl, dephosphoralating the backbone with SAP, and ligating in 
the Sbfl fragment from pCJM416, to give pCJM430. 

Plasmid pCJM430 is used to transform S. erythraea JC2. 
Integrants are selected for resistance to thiostrepton (50mg/L) 
and a number of integrants (typically 5-8) are analysed further 
by Southern blot to confirm that the strains are correct and to 
identify the site of integration. The resulting correct strain 
S. erythraea JC2/430 is cultured under conditions appropriate 
for the production of erythromycins (Wilkinson et al., 2000) and 
analysed for the production of novel compounds 7 & 8. 

Example 12: Disruption of borE (S. parvulus 
Tu4055/JborE: :aac3 (IV) ) 

In order to disrupt borE, an internal 761 bp fragment of 
the gene was amplified by PCR using primers B25A (5'- 
TTCTGCAGCCGCGGCCTTCG-3 1 ) (SEQ ID No. 82) and B25B (5»- 
AGAATTCGCCGGCGCCGCTG -3 1 ) (SEQ ID No. 83) using cosBor32A2 as 
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template. The product was purified, digested Pstl-EcoRl and 
cloned into pOJ260erraE* digested similarly, to provide pOJEdl. 
This approach was used in order to avoid possible polar effects • 
The vector pOJEdl was introduced into S. parvulus Ttt4055 by 
protoplast transformation as described in example 5 f and 
colonies were selected for apramycin resistance on R5 and then 
on MA agar. The disruption was verified by Southern 
hybridisation and the new mutant was named S. parvulus 
TU4055/Jbor£; :aac3.(IV) . Strain S. parvulus Tu4055/ borE : : aac3 (IV) 
was grown, extracted and analysed as described in example 1. No 
borrelidin production was observed whereas a wild type control 
produced borrelidin as expected. 

To verify that no polar effects were introduced a full- 
length copy of borE under, the control of the ermE* promoter was 
introduced in trans to the disrupted mutant. Full-length borE 
was amplified by PCR using the primers B7T1 (5'- 
GGCTGCAGACGCGGCTGAAG-3 ' ) (SEQ ID No. 84) and B7T2 (5'- 
CCGGATCCCAGAGCCACGTC-3 , ) (SEQ ID No. 85) using cosBor32A2 as 
template. The 1216 bp product was purified, digested with Pstl- 
BamHI and cloned into Pstl-XJbal digested pIJ2925 (Janssen & 
Bibb, 1993), along with a BamHI-Spel digested fragment from 
pLHyg containing the hygromycin resistance cassette, to generate 
pIJEH. A 2.8 kbp BamHI fragment was excised from pIJEH and 
cloned into pEM4 (Quiros et al., 1998) digested similarly to 
give pborEH (in which the borE gene was cloned in the correct 
orientation for gene expression) . pborEH and the control plasmid 
pEM4 were introduced into S. parvulus Tu4055/Jbor£: :aac (3) IV by 
protoplast transformation as described in example 5. The 
resulting strain S. parvulus Tii4Q55/borE: :aac (3) JV/pborEH was 
analysed as described in example 1 and shown to produce 
borrelidin at a titre similar to a WT control; the control 
strain S. parvulus TU4055/ borE: :aac (3) IV/pEM4 did not produce 

borrelidin. 

Chemical complementation of S. parvulus 
TH4055 / borE : :aac3 (IV) with trans-1, 2-dicyclopentane dicarboxylic 



82 



acid, following the protocol described in example 1, 

* * 

demonstrated that the strain thus grown was capable of 
borrelidin production at 122±23% of the WT parent control. Thus, 
borE is required for biosynthesis of trans-cyclopentane-1, 2- 
5 dicarboxylic acid. 

Example 13: Disruption of borL (5. parvulus 
T\14055/JborJ,: :aac3 (IV) ) 

In order to disrupt borL a 3.95 kbp Bglll fragment of 

10 cosBorl9B9, which contained the full-length borL, was sub-cloned 
into pSL1180 digested similarly. The resulting clones were 
analysed by restriction digest and one that displayed the 
correct orientation was chosen to provide pSL395. Digestion of 
pSL395with Miel and Spel, and subsequent re-ligation to 

15 eliminate a fragment of borM that included a Bglll site, gave 
pSL. The apramycin resistance cassette was excised with iKpnl 
from pEFBA (Lozano et al., 2000) and cloned into pSL that had 
been digested with KpriL, to give pSLLA. pSLLA was digested with 
Bglll and then subjected to Klenow treatment following the 

20 manufacturers instructions (Roche) ; an EcoRV fragment isolated 
from pLHyg containing the hygromycin resistance cassette was 
then cloned into this prepared vector to give pSLLrl. 

The replacement vector pSLLrl was introduced into S. 
parvulus Tti4055 by protoplast transformation. Colonies resistant 

25 to apramycin were selected, and then passaged several times 

through MA media without selection. The replacement was verified 
by Southern hybridisation. The new mutant was named S. parvulus 
TU4055/jborIr; :aac3 (IV) . 

Strain S. parvulus Tii4055/Jbori: :aac3 (IV) was grown, 

30 extracted and analysed as described in example 1. No borrelidin 
production was observed whereas a wild type control produced 
borrelidin as expected. Chemical complementation of 5. pazrvulus 
1014055 /borL: :aac (IV) using the natural starter acid as described 
in example 1 showed that the strain thus grown was capable of 

35 borrelidin production at 408±70 % of the WT parent control 
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titre. As a final control the strain was tested by in trans 
complementation with a full-length copy of JborL under the 
control of ermE* (see example 16) (in a manner as described in 
example 12) which re-established an approximately WT titre of 
borrelidin. 

Example 14: Disruption of JborJ (S. parvulus 
Ttt4055/borJ: :aac3 (IV) ) 

The gene JborJ and surrounding DNA was amplified from 
cosBor!9B9 using the PCR primers BP4501 (5 1 - 
CGTATGCATGGCGCCATGGA-3 1 ) (SEQ ID No, 86) and BP4502 (5 f - 
AGCCAATTGGTGCACTCCAG- 3 1 ) (SEQ ID No. 87). The 2.32 kbp product 
was purified, digested with Nsil-Mfel and cloned into pSLH80 
digested Nsil-EcoRl, to give plasmid pSLI. The apramycin 
resistance cassette was excised from pEFBA as an EcoRl fragment 
and cloned into pSLI digested with EcoRI, to give the plasmid 
pSLIA. Finally, the hygromycin resistance cassette was excised 
Spel-Pstl from pLHyg and cloned into pSLIA which had been 
digested with Nsil-Spel to give plasmid pSLIrl. 

The replacement vector pSLIrl was introduced into S. 
parvulus Tu4 055 by protoplast transformation as described in 
example 5. Colonies resistant to apramycin (25 >ig/ml) were 
selected, and then passaged several times through MA media 
without selection. The replacement was verified by Southern 
25 hybridisation and the new mutant was named S. parvulus 
Tu4055/JborJ; :aac3(IV) • 

S. parvulus Tu4 055 /JborJ ; :aac3 (IV) was grown and analysed 
as described in example 1. No borrelidin production was observed 
whereas several new compounds were observed at significantly 
30 lower levels. One of the less polar compounds displayed a UV 

absorbance maximum of 240 nm, and LCMS analysis indicated an m/z 
ratio 11 mass units lower than that for borrelidin, which is 
consistent with the presence of a methyl- rather than a nitrile- 
group at C12. As a final control the strain was tested by in 
35 trans complementation with a full-length copy of £orJ under the 
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control of ermE* (in a manner as described in example 12) which 
re-established an approximately WT titre of borrelidin. 

Example 15: Disruption of JborJ (S. parvulus 
Tu4055/borJ: :aac3 (IV) ) 

The gene borJ and surrounding DNA was amplified from 
cosBorl9B9 using the PCR primers BNHTl (5 1 -GTCATGCATCAGCGCACCCG- 
3 f ) (SEQ ID No. 88) and BNHT2 (5 1 -GTGCAATTGCCCTGGTAGTC— 3 1 ) (SEQ 
ID No. 89) . The 2,75 kbp product was purified, digested with 
Nsil-Mfel and cloned into pSL1180 that had been digested with 
NsiI-£7coRI, to give plasmid pSL. The hygromycin resistance 
cassette was excised from pLHyg as a Pstl-Spel fragment and 
cloned into pSL digested with Nsil-Spel, to give pSLJH. Finally , 
the apramycin resistance cassette was excised from pEFBA with 
Spel-BamHI and cloned into pSLJH that had been pre-digested with 
Avrll-Sglll in order to remove a 453 bp fragment from borJ, to 
give plasmid pSLJrl. 

The replacement vector pSLJrl. was introduced into S. 
parvulus Tu4055 by protoplast transformation as described in 
example 5. Colonies resistant to. apramycin (25 pg/ml) were 
selected, and then passaged several times through MA media 
without selection. The replacement was verified by Southern 
hybridisation. The new mutant was named 5. parvulus 
Tii4055/borJ: :aac3 (IV) . 

S. parvulus Tu4055/borJ"; :aac3 (IV) was grown and analysed 
as described in example 1. No borrelidin production was observed 
whereas a new compound more polar than borrelidin was observed 
with a UV maximum at 262 run. LCMS analysis indicated a parent 
compound of 508 amu, which is consistent with a carboxylic acid 
rather than a nitrile function at C12 . As a final control the 
strain was tested by in trans complementation with a full-length 
copy of borJ under the control of ermE* (in a manner as 
described in example 12) which re-established an approximately 
WT titre of borrelidin. 
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Example 16; Effects of borE up- regulation in S . parvulus Tii4055 
To examine the possibility that biosynthesis of the trans^ 
1,2-cyclopentane dicarboxylic acid starter unit may have a 

* 

limiting effect upon borrelidin production, borE was up- 
5 regulated in the parental strain and the effect upon borrelidin 
titre was analysed. The vector used, pborEH was described in 
example 12 . 

The vectors pborEH and pEM4 (control) were used to 
transform protoplasts of S. parvulus TQ4055 to give strains S. 
10 parvulus Tu4055 /pborEH and S. parvulus Tu4055/pEM4 respectively. 
Several colonies from each transformation were picked, grown in 
triplicate and then analysed as described in example 1. Compared 
to the control strain, up-regulation of borE brought about a 
4. 2±0. 3-fold increase in the titre of borrelidin. 




To examine the possibility that borL may have a 
regulatory, or some other related function involved in 
borrelidin production, the gene was up-regulated in the parental 
20 strain and the effect upon borrelidin titre was analysed. 

The expression vector pborLH was generated as follows: 
pSLL was digested with NotI, treated with Klenow fragment and 
then digested with BamHI to obtain a fragment of 2190 bp 
containing borL. This fragment was sub-cloned together with the 
25 BamHI -Spel hyg gene from pLHyg, into pEM4 digested with 
PstI (treated with Klenow) -Xbal, to obtain pborLH. 

The vectors pborLH and pEM4 (control) were used to 
transform protoplasts of S. parvulus Tu4055 to give strains S. 
parvulus Ttt4055/pborLH and S. parvulus Tu4055/pEM4 respectively. 
30 Several colonies from each transformation were picked, grown in 
triplicate and then analysed as described in example 1. Compared 
to the control strain, up-regulation of borL brought about a 
4. 3±0. 7-fold increase in the titre of borrelidin. 
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Example 18: Production of 12-desnitrile-12-methyl borrelidin 14 
(pre-borrelidin) 

Working stocks of S. parvulus Tu4055/JborX: :aac3 ( IV) (0.5 
ml) were inoculated into primary vegetative pre-cultures of NYG 

.5 as described in example 1. Secondary pre-cultures were prepared 
(as example 1 but with 250 ml NYG in 2 1 Erlenmeyer flasks) . 
PYDG production medium (4 1), prepared as in example 1 and with 
0.01 % Plutronic L0101 added to control foaming, was inoculated 
with secondary pre-culture (12.5 % inoculum) . A second fermenter 

10 containing centre-point medium (4 1) and 0.01 % Plutronic L0101 
to control .foaming, was set up in parallel and was also 
inoculated with secondary pre-culture (12.5 % inoculum) . Centre- 
point production medium contains per litre of deionised water: 
Tesco's skimmed milk powder (1.5 %) , Avidex W-80 (4.5 %) , 

15 glucose (0.5 %) and yeast autolysate (0.15 %) adjusted to pH 7.0 
with 5 M NaOH. 

These batches were each allowed to ferment in a 7 1 
Applikon fermenter for 6.5 days at 30 °C. Airflow was set at 
0.7 5 vvm (volume per volume per minute) > with tilted baffles and 

20 the impeller speed controlled between 400 and 800 rpm to 
maintain dissolved oxygen tension at or above 30 % of air 
saturation. No further antifoam was added. At 22 hours into the 
fermentation the starter acid, trans-cyclopentane-1, 2- 
dicarboxylic acid, was added as a neutralised solution of 1:1 

25 MeOH / 5 M NaOH, through an in-line filter (0.22 |im) . The final 
concentration in the fermenter vessel of exogenous starter acid 
was 0.5 mM. 

After 6 . 5 days of fermentation the broths were combined 
and acidified to pH 3.5 with concentrated HC1 (- 6 ml), then 
30 clarified by centrifugation at 3,500 rpm for 10 minutes. The 
supernatant was extracted into ethyl acetate (3x1 volume 
equivalent for 4 hours each) and the cell pellet left to steep 
in methanol (2 x 1.5 litres for 4 hours each). The organics were 
combined and removed under reduced pressure to yield a tarry 
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gum. The gum was re-suspended in 0.1 M Borax buffer (500 ml at 
P H 9.4) and washed with hexanes (500 ml) and ethyl acetate (500 
ml) . The aqueous layer was then acidified with concentrated HC1 
to pH 3.5 and extracted with ethyl acetate (3 x 500 ml), which 
5 were combined and taken to . dryness. The resultant gum was 

dissolved in methanol (15 ml), diluted with water (285 ml) and 
loaded under gravity onto a C 18 -reversed-phase cartridge (50 g, 
prepared in 5 % aqueous methanol) . The cartridge was washed with 
20 % and 50 % aqueous methanol (300 ml each) and eluted with 100 
10 % methanol (500 ml) . This last fraction was taken to dryness 

under reduced pressure to yield a black gummy-oil (600 mg) that 
was taken up in methanol. This residue was finally purified by 
sequential preparative reversed-phase HPLC (eluted with the 
mobile phases used in example 4, without added TFA, running 
15 isocratically at 40 %B) . Active fractions were' combined and 

desalted on a C 18 -cartridge (1 g) , to yield 28 mg of a dark oil 
(3.5 mg/1 isolated yield). Table 11 summarises the X H and 13 C NMR 
chemical shift data for 12-desnitrile-12-methyl borrelidin 14 in 

CDC1 3 . 

20 Table 11 
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Chemical shifts are referenced to CDC1 3 (for H at 7.26 ppm and 
for 13 C at 77.0 ppm) 



Example 19: Production of 12-desnitrile-12-carboxy borrelidin 2 

89 



Working stocks of S. parvulus Ttt4 0 55 /bor J: : a ac3 (IV) (0.5 
ml) were inoculated into primary vegetative pre-cultures of NYG 
as described in example 1. Secondary pre-cultures were prepared 
(as example 1 but with 250 ml NYG in 2 1 Erlenmeyer flasks) . 
5 PYDG production media (4 L) , prepared as in example 1 and with 
0.01 % Plutronic L0101 added to control foaming, was inoculated 
with the entire secondary pre-culture (10 % inoculum). This was 
allowed, to ferment in a 7 L Applikon fermenter for 6 days at 30 
°C. Airflow was set at 0.75 wm, with tilted baffles and the 
10 impeller speed controlled between 250 and 600 rpm to maintain 

dissolved oxygen tension at or above 30 % of air saturation. No 
further antifoam was added. A second fermentation was performed 
exactly as above, but which was batch fed with 0.2 mol of 
glucose as an aqueous solution every 12 hours from 60 hours 
15 post-inoculation. 

After 6 days the fermentations were harvested and 
combined. The broth was clarified by centrifugation (3,500 rpm, 
10 minutes) and the resultant supernatant acidified with 10 M 
HC1 (aq) to pH - 3.5. This solution was then extracted into 
20 ethyl acetate by stirring (3x1 volume equivalent for 4 hours 

each) . The cell pellet was extracted twice by steeping the cells 
in 1:1 methanol / ethyl acetate (500 ml). All the organics were 
combined and removed under reduced pressure to yield an aqueous 
slurry. The slurry was diluted to 500 ml with water, acidified 
25 to pH - 3.5 with 10 M HCl and extracted into ethyl acetate (3 x 
300 ml) . The organics were concentrated under reduced pressure 
to ~ 300 ml and extracted with 0.1 M borax (3 x 150 ml, pH = 
9.4) . The combined borax solutions were acidified with 10 M HCl 
to pH - 3.5 and extracted with 6 x 300 ml of ethyl acetate. 
30 Analytical HPLC demonstrated that some of the accumulant still 
resided in the borax solution and so this was loaded, under 
gravity, onto a C 18 -reverse-phase cartridge (50 g) . The cartridge 
was washed with water and the accumulant eluted in 100 % 
methanol. The organics containing the accumulant were combined 
35 and reduced to a 40 ml methanolic solution. This was loaded onto 
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a Sephadex LH-20 column (70 g, swelled overnight in methanol, 
column 60 cm x 2.5 cm), which was developed with 100 % methanol; 
the active fractions were combined and taken to dryness. The 
material was then further processed by preparative reversed- 
5 phase HPLC (eluted with the mobile phases used in example 4, 

without added TFA, running isocraticaily at 40 %B) . The combined 
active fractions were taken to dryness, dissolved in methanol (4 
ml) and diluted with water (200 ml) . This mixture was split into 
2 equal fractions and each loaded, under gravity, onto a Ci 8 - 

10 reverse-phase cartridge (20 g) . The columns were then eluted 

with 3 column volumes of 5 %, 10 %, 20 %, 30 %, 40 %, 50 %, 60 
%, 75 %, 90 % and 100 % aqueous methanol. The accumulant eluted 
in all fractions from 60 % to 100 % methanol, which were 
combined and taken to dryness. The accumulant (dissolved in . 

15 DMSO) was then finally purified by sequential preparative 
reversed-phase HPLC (eluted with the mobile phases used in 
example 4, without added TFA, running isocraticaily at 40 %B) . 
Active fractions were combined and desalted on a C 18 -cartridge (1 
g) , to yield 17 mg of a brown oil (2.1 mg/1 isolated yield). 

20 Table 12 summarises the X H and 13 C NMR chemical shift data for 
12-desnitrile-12-carboxy borrelidin 2 in da-methanol. 
Table 12 
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Chemical shifts are referenced to methanol (for l H at 3.35 ppm 
(quintet) and for 13 C at 49.0 ppm (septet)); *0bscured by solvent 
signal, d4-methanol. 

5 Example 20; Heterologous expression of borO in Streptomyces 
albus J1074 
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In order to examine whether the putative resistance 
protein BorO confers resistance to a borrelidin-sensitive 
organism, JborO was expressed in Streptomyces albus J1074. The 
gene borL was amplified by PCR using the primers BTRNAS1 (5 f - 
TGTCTAGACTCGCGCGAACA-3 ' ) (SEQ ID No, 90) and BTRNAS2 (5 1 - 
TGAATTCCGAAGGGGGTGGT-3 1 ) (SEQ ID No. 91) with cosBorl9B9 as 
template. The product was purified, digested Xbal-EcoRI and 
cloned into pEM4A that had been similarly digested to give 
plasmid pborOR which puts JborO under the control of the promoter 
ermE*. The vector pborOR was introduced into S. albus J1074 by. 
protoplast transformation (Chater & Wilde, 1980) and selected 
for apramycin resistance. The new strain was named S. albus 
J107 4 /pborOR. 

Resistance to. borrelidin was assayed on Bennett's agar 
containing apramycin at 25 jig/ml. Spores of S. albus 
J1074/pborOR and the control S. albus J1074/pEM4A were spread 
onto plates and then disks containing borrelidin at 100 & 200 
lig/ml were laid upon the lawn of spores and incubated overnight 
at 30 °C. Haloes indicating inhibition of growth were observed 
for the control strain harbouring pEM4A but not for 5. albus 
J1074/ pborOR. 
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